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Whither (wither?) tenure? 



T he system of tenure for university faculty in the 
United States arose in the late 19th to early 20th 
centuries to guarantee that academics would not 
be capriciously dismissed if they conducted re- 
search on controversial topics, pursued “unprom- 
ising” research, or did not conform to convention- 
al beliefs. Much in the world has changed in the 
past 100 years, from the demographics of the academic 
workforce and the scale of 
the educational enterprise, 
to the pace of discovery and 
the legal statutes surround- 
ing retirement. These chang- 
es prompt a reexamination: 

Is tenure the best way to 
nurture scholarly growth 
and academic freedom, or 
has its cost become too much 
to bear? 

Fortunately, the makeup 
of university faculty has 
changed over the past 100 
years as well. Today, the pro- 
portion of women scientists 
as professors has grown. 

However, they are still un- 
derrepresented among ten- 
ured faculty as compared 
to, for example, the number 
of women in similar posi- 
tions that do not require 
tenure, such as government 
scientists or university lecturers. A major reason is that 
young academics must concentrate on their careers to 
earn tenure at the same time as they would be starting 
their families, and this issue affects women dispropor- 
tionately more. The year that I came up for tenure at the 
Massachusetts Institute of Technology, I was told that no 
woman who had taken time out for childbirth had ever 
been granted tenure. My twins were born on Monday; I 
was back in the office on Thursday. Whether women 
see the tenure hurdle and opt out for family instead, 
or just never opted in to begin with, the result is 
that there are too few women for a diverse academic 
enterprise, and if this process does not evolve, how can 
the highest institutes of learning promote academic 
freedom and progress? 

Because higher education is essential for the best em- 
ployment prospects, colleges and universities have ex- 
panded to meet the demand. However, the growth has 
not been primarily in tenure-track faculty, but rather in 



lecturers, adjunct professors, and staff who generally ex- 
perience lower pay and benefits, are excluded from uni- 
versity governance, and survive on short-term contracts. 
Universities have found that this approach enables facile 
changes that complement the teaching staff as enrollment 
shifts and needs change. Nimble faculty can also change 
with the times, but not all tenured faculty are motivated 
to stay abreast of new developments. What might have 
been a booming job market 
20 years ago when a fac- 
ulty member earned tenure 
may be entirely moribund 
now. In some countries, 
such as the United States, 
the problem of obsolescence 
is exacerbated by the fact 
that retirement is no longer 
mandated at a certain age. 
Average life expectancy has 
increased since tenure was 
first instituted. Today, ten- 
ured professors can continue 
to hold their positions 40 to 
50 years past the date when 
they received tenure. 

Revising the tenure sys- 
tem to a more flexible form 
of employment is not going 
to be easy. Those in a posi- 
tion to change the system 
are the ones who person- 
ally benefited from it. Those 
hurt by the system are powerless. But it’s time for 
universities to discuss unilateral action and institute 
some other mechanism. For example, promotion to as- 
sociate professor could be rewarded with a longer-term 
contract (10 years), followed by a series of renewable 
10-year contracts (or in rare cases, longer contracts) 
as a full professor. The contracts would be nonbind- 
ing, giving the faculty member flexibility to consider 
opportunities at other institutions. Such a change 
would encourage faculty to experience the intellectual 
boost that comes from interaction with different col- 
leagues at a new university and would also facilitate 
institutional renewal. An appeals process (through a 
national university association) could adjudicate con- 
tract disputes or cases of dismissal on grounds of intel- 
lectual disagreements. 

The long-standing debate on tenure has not yet re- 
sulted in any serious push to change things. At what cost? 

- Marcia McNutt 




“...has its cost become too 
much to bear?” 




Marcia McNutt 
Editor-in Chief 
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66 Froome’s values are close to what we 
believe are the upper limits for V0 2 peak in humans. 99 

Phillip Bell of the GlaxoSmithKline Human Performance Lab, to Esquire, 
on measuring oxygen uptake by repeat Tour de France winner Chris Froome, who is 
regularly accused of doping or using illegal bikes. 




Hawaii telescope in legal limbo 



Construction of the 
Thirty Meter Telescope 




(here in an artist’s 
rendering) has been 
halted indefinitely. 



T he effort to build the largest optical observatory in the United 
States, the Thirty Meter Telescope (TMT) in Hawaii, was halted 
indefinitely on 2 December after the state’s supreme court ruled 
that the project’s building permits were invalid. Chief Justice 
Mark Recktenwald decided that the state Board of Land and 
Natural Resources (BLNR) had wrongly approved the permits 
in 2011 before a contested case hearing could be resolved, thus not 
allowing opponents due process. The decision supersedes a lower 
court’s ruling in 2014 that upheld the permits. The BLNR must now 
allow the contested case hearing to be held before a new permit is 
issued. David Callies, a law professor at the University of Hawaii, 
Manoa, says opponents will likely stretch out the process with new 
litigation. “In that case, you’re talking more than 1 or 2 years,” he 
says. Native Hawaiian opponents of the project say the 18-story TMT 
structure at the summit of the Mauna Kea volcano would be further 
desecration of one of their most sacred places. Protesters have been 
blocking construction workers’ access to the site since March; about 
70 people have been arrested. 



AROUND THE WORLD 

Reproducibility study trimmed 

CHARLOTTESVILLE, VIRGINIA AND PALO 

alto, California | An ambitious effort 
to check the reproducibility of 50 top 
papers in cancer biology is scaling back 
because of higher than expected costs. The 
Reproducibility Project: Cancer Biology, 
run by the Center for Open Science in 
Charlottesville, Virginia, and Science 
Exchange in Palo Alto, California, set 
out 2 years ago to replicate key experi- 
ments from highly cited cancer papers 
published from 2010 to 2012 {Science, 

25 June, p. 1411). But the $1.3 million it 
had from a foundation to redo the stud- 
ies was not enough, and last week the 
project’s managers noted online that 13 of 
the 50 replications have been put on hold. 
Most involve costly animal experiments. If 
funding can be found, efforts to reproduce 
the studies will resume. 

Progress on Red-Dead conduit 

amman | Israel and Jordan last week 
issued requests for proposals to build an 
$800 million water project that would 
include a pipeline linking the Red Sea and 
the endangered Dead Sea (below). The 
project, which has drawn criticism from 
environmental scientists, has been on the 
drawing board for 3 decades and is slated 
to be completed around 2020. Jordanian 
water and irrigation minister Hazim 
El-Nasser and Israeli Vice Prime Minister 
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Silvan Shalom unveiled the project, 
intended to provide potable water to the 
region. Planners envision a desalination 
plant in Aqaba, Jordan, that will have an 
initial capacity of 65 million cubic meters 
of water per year. Jordan and Israel will 
share that water, and Shalom believes the 
project can also help “save” the Dead 
Sea, where levels are dropping by about 
1 meter per year, by transporting in brine 
from the Red Sea. But critics fear miner- 
als in the transported water could harm 
the fragile Dead Sea ecosystem, or that 
pumping could affect organisms in the Red 
Sea. Advocates for Palestinian farmers also 
object to the exclusion of the Palestinian 
National Authority from the deal. 

Japan defends whaling plan 

Tokyo | Japan has resumed its controver- 
sial lethal research whaling in the Southern 
Ocean, because it wants to determine how 
many minke whales can be harvested 
sustainably and also study the Antarctic 
environment, Joji Morishita, the nation’s 
representative to the International Whaling 
Commission (IWC), told a press confer- 
ence 7 December. “We did our best to try 
to meet the criteria established by the ICJ 
[International Court of Justice] and we have 
decided to implement our research plan 
because we are confident we have com- 
pleted our scientific homework,” Morishita 
said. In March 2014, the ICJ had ordered 
the nation to halt its research whaling, 
ruling that its program at the time, which 
sought to take 850 minke whales, 50 fin 
whales, and 50 humpback whales, did not 
meet stipulated standards of whaling for 
purposes of scientific research. The country 
unveiled a new program in November 2014 
that calls for taking 333 minke whales; 

IWC’s Scientific Committee examined the 
new program but last June reported that it 
could not reach a consensus. 

Virus may cause birth defects 

Brasilia | The Zika virus, an emerging 
virus that is causing an unprecedented 
epidemic in Brazil and is quickly spreading 
through Latin America, may be respon- 
sible for a spike in severe birth defects. 
Transmitted by mosquitoes, Zika usually 
causes relatively mild symptoms, including 
fever and rashes. But the Brazilian govern- 
ment warned last week that the virus may 
be responsible for a dramatic increase in 
cases of microcephaly, a severe birth defect 
in which the brain fails to develop properly 
and the head is much smaller than normal. 
The connection is not yet proven, but 
several lines of evidence point toward Zika 
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as a culprit: The virus has been found in 
the amniotic fluid of two fetuses diagnosed 
with microcephaly via ultrasound, and has 
also been found in tissues of a baby with 
microcephaly that died shortly after birth. 
http://scim.ag/Zikamicrocephaly 

Second try at Venus finds success 

tokyo | A spacecraft designed to study 
Venus’s atmosphere that missed its 
target 5 years ago has apparently suc- 
ceeded in entering an orbit around the 
planet, according to the Japan Aerospace 
Exploration Agency (JAXA). An engine 
malfunction during its first rendezvous 
with Venus on 7 December 2010 sent 
the probe, named Atatsuki, on a 5-year, 
10-orbit trip around the sun. Engineers 
used the time to develop a scheme to 



insert the craft into orbit using four small 
attitude control thrusters. JAXA reported 
that the thruster firing went as planned 
this week and that Akatsuki “is now in 
good health” and apparently circling 
Venus. Mission controllers will confirm the 
trajectory of the probe in the coming days. 



NEWSMAKERS 

Integrity office gets new head 

The U.S. office that guards against fraud in 
federally funded biomedical research has a 
new chief. Kathy Partin, a basic neurosci- 
entist and administrator at Colorado State 
University (CSU), Fort Collins, will become 
director of the federal Office of Research 
Integrity (ORI) the week of 27 December. 
Partin will replace David Wright, who 
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The secret to cuttlefish ‘invisibility’ 



W hen threatened by predators, cuttlefish (such as Sepia officinalis, below) stop 
breathing and freeze in place. That isn’t just for visual camouflage, suggests a 
study in the Proceedings of the Royal Society B. Sharks can detect the electri- 
cal signals produced by an animal’s gills— and less water flowing over the 
cuttlefish’s gills makes it harder to detect, it turns out. In the lab, researchers 
measured electrical signals generated by cuttlefish— first, at rest on the floor of a 
tank, and then startled by videos of a looming predator. When the frightened crea- 
tures froze and covered the cavity leading to the gills with their tentacles, the voltage 
in the water dropped by about 80%. The tactic works beautifully on sharks, the team 
found: They simulated electrical signals from a fleeing cuttlefish, a cuttlefish at rest, 
and a cuttlefish holding its breath. The sharks detected the fleeing cuttlefish 94% of 
the time and from up to 38 centimeters away; they sensed the resting cuttlefish from 
20 centimeters away and struck 62% of the time, but they had to get within 15 centi- 
meters of the frozen cuttlefish to detect it— and still only struck 30% of the time. 
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left ORI after 2 years in March 2014 after 
becoming frustrated with the federal 
bureaucracy and the little regard his office 
was given by the Department of Health 
and Human Services. ORI has often been 
criticized for moving too slowly to close 
cases and for meting out relatively light 
punishments for those found guilty of 
research misconduct. Partin studies glu- 
tamate receptors in the brain, is assistant 
vice president for research at CSU, and 
directs the university’s research integrity 
office. http://scim.ag/PartinORI 

Three Q’s 

The new open-access scientific journal 
Matters aims to publish single observa- 
tions rather than complete stories. 

Science spoke with the journal’s founder, 



Lawrence Rajendran, a cell biologist at 
the University of Zurich in Switzerland. 
http://scim.ag/Mattersjournal 

Q: What motivated you to launch this 
new journal? 

A: As a postdoc applying for jobs 8 years 
ago, I was shocked to discover that certain 
figures from retracted manuscripts had 
gotten published elsewhere. It made me 
wonder if the pressure to publish “good 
stories” could be nudging authors to tweak 
data that doesn’t neatly fit a storyline. I 
remember thinking: If people could just 
report the one thing they’re comfortable 
with, they wouldn’t have to be dishonest. 

Q: Tell us about Matters’ submission and 
review process. 

A: We’re focusing on cell biology, biochemis- 
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Pluto mission reveals best images yet 

N ASA’s New Horizons spacecraft has returned the best pictures of Pluto the world 
may ever see. The new images, at resolutions of about 80 meters per pixel, show 
a striking shoreline, where smooth plains of nitrogen ice from Pluto’s “heart” rub 
up against water ice mountains several kilometers high. The jumbled-up rubble 
at the base of the ice mountains helps confirm team members’ theories that the 
mountains are, in fact, giant icebergs that have moved around on more plastic layers 
of nitrogen ice below. The spacecraft made its closest approach to the dwarf planet 
in July. But because of the great distances and the spacecraft’s low-power antenna, 
some of the best data are reaching Earth only now. 



try, biophysics, neuroscience, and genetics. 
The first 500 submissions to the journal 
will be free; after that, Matters will charge 
$150 per submission from universities and 
other nonprofits, and $300 per submission 
from for-profit entities. We use a triple- 
blind peer-review process and will publish 
all scientifically sound observations— even 
negative data and confirmatory data— 
within 2 weeks of submission. 

Q: But there’s a good reason many journals 
publish “stories”— people like stories. 

A: An initial finding can be extended with 
subsequent observations by the original au- 
thors or others, who can then build narra- 
tives from the collection of findings. It will 
be like Twitter— you can always go back to 
who said it first. Our tagline is not “stories 
don’t matter”— it’s “stories can wait.” 



BY THE NUMBERS 

55 

Percent of giant exoplanets spot- 
ted by NASA’s Kepler mission that 
are false positives— not planets but 
another object such as a brown 
dwarf, said scientists at the Extreme 
Solar Systems III conference last 
week in Hawaii. 




million 



Amount, in U.S. dollars, of Australia’s | 
new National Innovation and Science g 

Agenda, announced 7 December to | 

help reverse deep cuts to science | 

made by the previous administration. 8 

http://scim.ag/Ausboom w 

co 

€5 | 

billion I 
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Total amount of investment in R&D z 

X 

per year, by both public and private § 

sources, that the Irish government < 

aims for by 2020. Last year, the total g 

spent was €2.9 billion. I 
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GENETIC ENGINEERING 

Germline editing dominates DNA summit 

Prospect of heritable changes sparks questions about safety, ethics, and rationale 



By John Travis 

( congressman’s warning. The long 
shadow of eugenics. Philosophi- 
cal, ethical, and moral debates. 
Uncertain science and patchy regu- 
lations. And a cast that included 
scientists expected to share a Nobel 
Prize for a new DNA-changing technology, 
commonly called CRISPR. All was on dis- 
play last week at the International Summit 
on Human Gene Editing, held at the U.S. 
National Academy of Sciences (NAS) in 
Washington, D.C. 

“CRISPR and related technologies have 
the potential to revolutionize the treat- 
ment of diseases but could be used in 
many ways not beneficial to society,” noted 
Representative Bill Foster (D-IL), the only 
Ph.D. physicist in the U.S. Congress, as the 
summit opened. 

The gathering ended on a similar note: 
The organizing committee strongly en- 
dorsed the use of CRISPR and similar meth- 
z ods for basic research that involves altering 
| DNA of human sperm, eggs, and embryos, 
S but concluded that producing a pregnancy 
% from such modified cells or embryos is cur- 
| rently “irresponsible” because of ongoing 
£ safety concerns and a lack of societal con- 
° sensus. Yet the group— 12 biologists, physi- 
S cians, and bioethicists— did not flatly rule 
< out future use of such “germline editing,” 
| in which the DNA changes would be passed 
| down from one generation to the next, 
p Much of the summit’s discussions re- 

o 

l volved around whether germline modifi- 
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cations could be justified to prevent the 
inheritance of disease or, more contro- 
versially, to produce “enhancements” such 
as higher IQ or a reduced need for sleep. In 
fact, the organizers called for an “ongoing 
international forum” to regularly revisit the 
issue of germline editing. “The unthinkable 
has become conceivable,” declared biologist 
David Baltimore of the California Institute 
of Technology in Pasadena, the chairperson 
of the organizing committee. 

The new molecular toolkit motivating the 
summit, CRISPR, consists of a DNA-cutting 
enzyme called a nuclease and a piece of RNA 



“The unthinkable 
has become conceivable.” 

David Baltimore, California Institute 
of Technology 

that homes in on a DNA sequence, en- 
abling researchers to create precisely tar- 
geted mutations, corrections to mutations, 
or other alterations. Along with two earlier 
genome editors, zinc finger nucleases and 
TALENs, CRISPR is transforming basic 
biology and aiding the development of new 
crops and farm animals. Earlier this year, a 
Chinese team became the first to publicly 
report using CRISPR to alter the DNA of 
human embryos— nonviable ones— from in 
vitro fertilization (IVF) clinics, and a U.K. 
group said it wanted to do similar research 
in the United Kingdom. In response, NAS, 



the U.S. National Academy of Medicine, 
the United Kingdom’s Royal Society, and 
the Chinese Academy of Sciences rushed to 
convene the summit. 

By tweaking CRISPR’s RNA or nuclease 
components, researchers expect to make 
the tool, developed from an immune sys- 
tem found in bacteria, still more precise. 
And “there are likely more powerful sys- 
tems still out there in nature,” noted Feng 
Zhang of the Broad Institute in Cambridge, 
Massachusetts, who at one point shared the 
stage with other CRISPR pioneers, Jennifer 
Doudna of University of California, Berke- 
ley, and Emmanuelle Charpentier of the 
Max Planck Institute for Infection Biology 
in Berlin. Many predict the trio will win a 
Nobel for developing the system. 

Few at the summit had qualms about us- 
ing CRISPR and its rivals in somatic cells to 
treat disease. By applying zinc finger nucle- 
ases to immune cells in vitro, groups have 
already deactivated the gene for CCR5, a pro- 
tein that the AIDS virus uses to invade the 
cells; the modified cells were then returned 
to HIV-infected people. And last month, a 
team reported on using TALENs to edit can- 
cer-fighting immune cells that subsequently 
arrested leukemia in a 1-year-old girl. Speak- 
ers also discussed plans to use CRISPR on 
blood stem cells to reverse disorders such as 
sickle cell anemia and beta thalassemia by 
fixing hemoglobin gene mutations. 

Biologist Janet Rossant of The Hospi- 
tal for Sick Children in Toronto, Canada, 
among others, spoke of the need to go fur- 
ther and use gene editing to do basic re- 
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search on human embryos— work that the 
U.S. National Institutes of Health is cur- 
rently prevented by law from funding and 
that could get scientists imprisoned in Ger- 
many Such studies, she explained, could 
probe the causes of miscarriages, birth 
defects, and developmental disorders. “We 
still have a lot more to learn about the hu- 
man embryo and preimplantation stages,” 
Rossant said. Attendees largely agreed, but 
differed about whether spare IVF embryos 
are good enough for such work, or whether 
the door should be open to creating em- 
bryos just for research. 

Germline gene editing to produce babies 
stirred deeper debate. The summit heard a 
primer on the dark history of eugenics, but 
John Harris, a philosopher at the University 
of Manchester in the United Kingdom, later 
argued that nothing is sacred about the 
germline. All forms of assisted reproduc- 
tion affect future generations, he said, and 
normal reproduction is 
a “genetic lottery” that 
often produces birth de- 
fects and disease. 

For now, the best ar- 
gument for germline 
editing may be its po- 
tential to correct known 
genetic errors and al- 
low parents to have 
healthy children. That 
did not sway Catholic 
theologian Hille Haker 
of Loyola University 
Chicago in Illinois, who 
contended that no one 
has a right to have ge- 
netically related chil- 
dren and called for a 
ban on all human germ- 
line editing research. 

Also stirring discus- 
sion was the question 
of whether germline 
editing is actually the best way to prevent 
the transmission of genetic diseases. Many 
cited a potential alternative: preimplanta- 
tion genetic diagnosis (PGD), a relatively 
new procedure in which cells removed from 
IVF embryos are screened for inherited mu- 
tations; only healthy embryos are then im- 
planted. In most cases in which one or both 
parents have a known inheritable disease, 
Mendelian genetics imply that some fraction 
of their embryos will be free of the respon- 
sible mutation(s). With PGD, those healthy 
embryos could be identified and implanted. 
Baltimore crystallized the issue, asking: “Is it 
more ethical to edit embryos or screen a lot 
of embryos and throw many away?” 

In some cases, however, no IVF embryos 
would be normal, rendering PGD useless. 
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For example, if both parents have cystic fi- 
brosis, an autosomal recessive disorder, any 
offspring would carry double mutations. 
Similarly, a person with two dominant mu- 
tations, such as those causing Huntington 
disease, would pass on one. Speakers raised 
other scenarios that might call for germline 
editing, including bids to reverse infertility 
and parents seeking a “savior sibling” to 
provide matched, healthy bone marrow for 
a sick older child. 

For now, few would say genome editing 
is safe enough for such uses. In the human 
embryo experiment, CRISPR cut many non- 
targeted genes. And Rudolf Jaenisch of the 
Massachusetts Institute of Technology in 
Cambridge noted that when CRISPR re- 
pairs one copy of a disease gene, it some- 
times mutates the healthy copy. Eric Lander 
of the Broad Institute pointed out that if 
editing is attempted in early-stage human 
embryos, rather than in sperm or eggs, it 
might only reach some 
cells, not all, resulting 
in a mosaic embryo with 
some mutant tissues. 

Technology may not 
be a showstopper, how- 
ever Zhang and others 
at the summit were op- 
timistic about reducing 
CRISPR’s worrisome off- 
target edits. But Lander, 
who was on the summit’s 
organizing committee, 
reminded attendees 
that the issues go well 
beyond technical skill, 
especially when it comes 
to genetic enhancement. 
We know far too little 
about the human ge- 
nome’s role in cognition 
and other traits to try 
to mess with it, he said. 
“The conclusion is sim- 
ply humility. Before we make permanent 
changes to the human gene pool, we should 
exercise considerable caution.” 

Even if some parents and clinicians even- 
tually decide it’s safe and sensible to do 
germline engineering, they face a pastiche 
of conflicting laws and regulations around 
the world. More than 40 countries outlaw 
tampering with the germline in IVF efforts, 
although many others, including the United 
States, do not. The U.S. government will not 
fund clinical trials of such work, but gives 
free rein to private IVF clinics. Near the end 
of the summit, Barbara Evans of the Univer- 
sity of Houston Law Center in Texas noted 
that “the ‘science’ of regulation is more pre- 
carious and uncertain than the science of 
gene editing.” ■ 



MARINE MAMMALS 

A whale’s life, 
inscribed in 
baleen 

Fibrous keratin can store 
a chemical record of 
pregnancies and stress 

By Rebecca Kessler 

I n 2004, a collision with a ship cut short 
the life of an endangered North Atlantic 
right whale. Many lamented the death 
of Stumpy, as scientists called her— all 
the more because she was pregnant. But 
Stumpy’s demise could ultimately help 
other whales. Researchers are using plates 
of her tough, hairlike baleen to develop a 
new method of reconstructing a whale’s life 
story— much as scientists use growth rings 
to reveal a tree’s past. The nascent tech- 
nique could make it easier to study, and 
perhaps protect, whales. 

Deriving whale biographies from baleen 
is just one concept to be discussed next 
week at the 21st Biennial Conference on the 
Biology of Marine Mammals in San Fran- 
cisco, California. Some 3000 researchers 
will share insights into some of the world’s 
most charismatic creatures, including first- 
time audio recordings of rare species and 
updates on struggling populations. They’ll 
also discuss novel ways of assessing the 
health and physiology of free-swimming 
mammals that can be hard to study in 
the wild. 

That’s where the work involving 
Stumpy— and a second right whale named 
Staccato— will come in. Over the past de- 
cade or so, wildlife researchers have been 
figuring out how to pry life history informa- 
tion out of tissues based on keratin, the fi- 
brous protein found in nails, hair, feathers, 
horns, claws, hooves, and skin. Researchers 
have used hair, for example, to study past 
drug use in humans and exposure to toxins 
in wildlife. 

Keratin is also a major component of ba- § 
leen, the comblike structures on the upper § 
jaws of about 14 whale species that help | 
the animals catch food. The tips of baleen % 
plates— the comb’s teeth— wear away over | 
time, so whales continuously grow new <t 
keratin to replace what’s lost. The new ba- | 
leen stores chemicals, such as hormones or p 

o 

contaminants, which are circulating in the £ 
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human embryos (shown here) offers an 
alternative to editing their DNA. 
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Hormones stored in the baleen 
of this bowhead whale, killed by 
Native Alaskan hunters, can help 
tell its life story. 



whale’s blood at the time. That makes each 
plate— which can represent years or de- 
cades of growth— a time capsule, recording 
important life events such as pregnancies 
or exposure to stressful situations. 

Such information can be crucial to con- 
serving rare species such as North Atlantic 
right whales, which number just about 500. 

When it comes to large whales, “there’s no 
other animal on the planet where we know 
[so] little about basic reproduction,” says 
Kathleen Hunt, a research scientist at the 
New England Aquarium in Boston. For 
example, scientists don’t know how long 
female right whales carry their calves, or 
how frequently they become fertile. They 
also can’t easily tell whether the whales are 
experiencing harmful stress resulting from 
ocean noise, pollution, or other factors. 

Hunt and Rosalind Rolland, a colleague 
at the aquarium, have already developed 
noninvasive ways to measure whale hor- 
mones associated with 
stress and reproduction. 

They involve going to sea 
to scoop up fresh dung or 
to collect “whale blow”— the 
air and droplets an animal 
exhales through its blow- 
hole. But such samples 
provide only a snapshot 
of a whale’s current physi- 
ological state. Hormone 
measurements from baleen 
promised a fuller backstory. 

Last year, the research- 
ers published their first re- 
sults from a pilot study of 
16 bowhead whales killed as 
part of a legal hunt by na- 
tive Alaskans. Bowheads are Baleen from Stumpy, a right whale killed by a ship, yielded a record of her pregnancies. 



a long-lived species, and their baleen can 
record about 25 years of growth. In Conser- 
vation Physiology, a team led by Hunt and 
Rolland reported that not only could they 
detect two hormones in baleen— the stress 
hormone cortisol, and the reproductive 
hormone progesterone— they could also use 
them to distinguish males from females, 
and identify pregnant females. They also 
saw progesterone spikes that looked suspi- 
ciously like signatures of past pregnancies. 

To know whether those signatures were 
real, the researchers needed baleen results 
they could match against documented 
whale pregnancies. Enter Stumpy and 
Staccato, also killed by a ship, in 1999. As 
a result of efforts by government agencies 
and conservation groups to monitor and 
protect right whales, scientists had records 
indicating when each whale had given birth 
to some of her calves. 

In San Francisco, Hunt and her col- 



leagues will unveil new 
results that suggest re- 
searchers can read baleen 
like a history book. When 
they analyzed plates from 
Stumpy and Staccato re- 
cording about 10 years of 
growth, they found that 
progesterone levels spiked 
in material dating to docu- 
mented pregnancies, and 
then dropped for several 
years while the mothers 
were known to be raising 
calves and later traveling 
solo. The pattern was “in- 
credibly striking and obvi- 
ous,” Hunt says. That was a 
huge contrast to the “very 
subtle patterns” she’s used 
to seeing in other kinds of 
whale data, she says. “Yeah! 
Most exciting results I’d 
ever seen in my career.” 

The team has also been able to measure 
cortisol in the baleen, and they see patterns 
suggesting that stressors, such as getting 
entangled in commercial fishing gear, can 
interfere with a whale’s ability to get preg- 
nant. Stumpy’s baleen, for instance, showed 
elevated cortisol levels after one pregnancy, 
and she didn’t get pregnant again until the 
cortisol levels dropped. 

Assuming the technique works in other 
baleen whales, researchers say it would be 
a welcome new tool. Currently, the only 
other way to reconstruct a whale’s past is by 
analyzing the layers of wax that accrete in 
some whale ears. Both the wax and baleen 
can be collected only from dead animals, 
but Hunt notes that the wax doesn’t last 
long and is rarely collected. In contrast, ba- 
leen is sturdy and long lasting. That could 
enable researchers to see far back in time 
by analyzing museum specimens of baleen, 
some of which are more 
than a century old, or whale 
remains found at archaeo- 
logical sites. 

Such “specimens are a 
treasure trove of biological 
information just waiting 
to be unlocked,” says eco- 
toxicologist Thea Bechshoft, 
a postdoctoral researcher 
at the University of Alberta, 
Edmonton, in Canada. The 
baleen work, she says, “adds 
to the growing evidence of 
how amazingly useful kera- 
tinous tissue ... can be.” ■ 



Rebecca Kessler is a jour- 
nalist in Providence. 
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WEATHER FORECASTING 

CubeSats promise to fill weather data gap 

Companies bet on a GPS technique and an agency shift to commercial data 



By Eric Hand 

T he massive weather satellites that 
plumb the atmosphere are facing a 
swarm of tiny competitors. Data from 
shoebox-sized private satellites, ex- 
ploiting a new technique for probing 
the atmosphere, could significantly 
reduce forecast errors, researchers say But 
national weather agencies, used to generat- 
ing their own data, have so far been reluc- 
tant customers. At a workshop this week, 
the U.S. National Oceanic and Atmospheric 
Administration (NOAA) hoped to address 
some of the stumbling blocks. 

The big government systems are becom- 
ing increasingly fragile 
and unaffordable. Witness 
the $11.3 billion Joint Po- 
lar Satellite System (JPSS) 
at NOAA: It is drastically 
over budget and behind 
schedule for the planned 
2017 launch of its JPSS-1 
satellite, jeopardizing the flow of crucial 
weather data. 

Companies say they can fill the gap with 
data from cheap CubeSats, small satellites 
the size of one or several 10-centimeter 
cubes ( Science , 10 April, p. 172). Last week, 
Boulder, Colorado-based PlanetiQ booked 
a 2016 flight for its first two weather satel- 
lites. And after the successful launch of four 
satellites in September, San Francisco, Cali- 
fornia-based Spire says it is in negotiations 
to license its data to both government and 
private entities. Yet NOAA has been hesi- 
tant to support the satellite weather start- 
ups, says Mariel Borowitz, a space policy 



researcher at the Georgia Institute of Tech- 
nology in Atlanta. “They have said, ‘Build it 
and once it’s in operation, then we’ll decide 
if we want it or not.’” 

One reason is NOAA’s concern that the 
private companies would restrict its abil- 
ity to share data freely with other agencies 
and governments. This week’s workshop 
built on a draft commercial space policy the 
agency released in September, which sug- 
gests that data buys should comply with a 
guiding principle of open and free data. 

The companies are taking advantage of 
one of the most promising remote sens- 
ing techniques of the last 20 years: GPS 
radio occultation. Dozens of GPS satellites 



circle the globe in medium-Earth orbit, 
constantly broadcasting the radio signals 
used to locate everything from cellphones 
to drones. When the signals pass through 
Earth’s atmosphere, they bend slightly, like 
light refracting in water. A satellite in low- 
Earth orbit can pick up signals that have 
skimmed the atmosphere, calculate the 
bending from their travel time, and use it 
to infer temperature, pressure, and humid- 
ity at particular altitudes. A constellation of 
small satellites could measure thousands of 
these vertical profiles each day as GPS satel- 
lites constantly rise and set on the horizon. 

Weather balloons offer similar data, but 



they are released mostly over land in the 
Northern Hemisphere. Radio occultations, 
by contrast, occur day and night and are 
evenly distributed around the globe. Oc- 
cultations also offer advantages over the 
infrared and microwave data collected by 
massive satellites like JPSS. Infrared sound- 
ers cannot see through clouds, and micro- 
wave sounders are confused by moisture, 
which adds uncertainty to predictions of 
hurricane paths and intensities. 

What’s more, because occultation mea- 
surements depend only on the timing of an 
arriving GPS signal, it is easy to compare 
data from different satellites and cam- 
paigns without complex corrections. Once 
a few decades of data are 
in place, radio occultations 
could even track global 
warming trends. “You don’t 
need to bias-correct the 
system,” says Sean Healy, 
a scientist at the European 
Centre for Medium-range 
Weather Forecasting (ECMWF) in Reading, 
U.K. “That’s why it should be useful for cli- 
mate monitoring.” 

As countries expand their GPS networks 
(China and India are rapidly building their 
own systems), the potential for radio- 
occultation data grows. Healy says the EC- 
MWF currently primes its forecasts with as 
many as 3000 daily radio occultation pro- 
files, gathered by a handful of satellites— 
but the agency wants more. A 2013 study 
found that including 16,000 daily profiles 
in the ECMWF model could significantly re- 
duce forecast errors, and that benefits con- 
tinued to accrue with as many as 128,000. 

So far, the data have come from govern- 
ment satellites, but the startup companies 
see an opportunity. “GPS radio occultation 
is the best candidate for attempting com- 
mercialization of a weather satellite sys- 
tem,” Borowitz says. 

Some want the agency-owned approach 
to continue. A public, $100 million system 
of six satellites called COSMIC, launched 
in 2006, pioneered the technique, and its 
leaders have secured $450 million from Tai- ^ 
wan, the U.S. Air Force, and NOAA for COS- | 
MIC-2, a new system of 12 satellites. Rick ^ 
Anthes, who helped develop COSMIC when =? 
he was president of the University Corpora- % 
tion for Atmospheric Research in Boulder, | 
thinks the public approach is the better | 
bet. “There are definite plans and definite d 



Bent over the horizon 

By picking up GPS signals passing through the atmosphere, small GPS radio occultation (GPS-RO) satellites can 
gather weather data. 




“Every single GPS radio occultation profile we can 
get our hands on ... will save lives and money.” 

Peter Platzer, CEO, Spire 
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PHYSICS 

Controversial test finds no 
sign of a holographic universe 

Speculative effort was destined to fail, critics say 



instruments and definite data quality,” he 
says. He notes that COSMIC-2 satellites, at 
70 kilograms, are bigger than the private 
companies’ CubeSats. A bigger satellite can 
carry a bigger antenna and better receivers, 
which boosts signal quality, he says. 

But Spire, with $80 million in investor 
funding and a nimble, Silicon Valley ap- 
proach, believes that a bigger constella- 
tion can overcome any limitations in data 
quality. In September, four of its “Lemur” 
satellites— each a 5-kilogram “3U” box tak- 
ing up three CubeSats’ worth of space- 
reached orbit on an Indian rocket. “All are 
nominal, fully functioning, and delivering 
data to us,” says founder and CEO Peter 
Platzer. He would not say how many pro- 
files the Lemurs are producing, only that 
four to eight additional Lemurs are likely 
to be launched every month for the next 2 
years, and that the company is on track to 
produce 20,000 profiles a day by the end of 
2016. He says more than half of the world’s 
10 largest weather-forecasting agencies 
have sent Spire letters of intent for data 
licensing agreements. 

With a 10 -kilogram, 6U CubeSat de- 
sign, PlanetiQ’s satellites are twice the size 
of Spire’s— a difference that CEO Chris 
McCormack says makes possible better data 
quality. His bigger satellites can tune in to 
multiple GPS networks, whereas Spire’s 
first Lemurs listen only to the U.S. system. 
With $5 million in investor funding, Plan- 
etiQ says that by the end of 2017 it will be 
operating a constellation of 12 satellites ca- 
pable of generating 34,000 profiles a day. 

McCormack is palpably frustrated with 
NOAA’s reluctance to encourage commer- 
cial data. “They have not been a supporter 
of data buys whatsoever,” he says. “They 
don’t like change.” The agencies so far are 
insisting on freely shareable commercial 
data. Platzer says he would be happy to of- 
fer such a licensing agreement— but that 
it would be much more expensive. The 
companies hope to profit by selling data 
to multiple buyers, both private and pub- 
lic. Borowitz says, however, that the global 
data-sharing protocols specify that all “es- 
sential” data be shared freely— a word that 
could offer agencies some interpretive wig- 
gle room. 

NOAA declined to make officials avail- 
able for an interview. In a statement, 
spokesperson John Leslie said the agency 
is reviewing comments in response to 
its draft commercial space policy. But 
Platzer is confident that agreements can 
be reached. “The global weather commu- 
3 nity can use every single GPS radio occul- 
ts tation profile we can get our hands on,” 
g he says. “Every single one will save lives 
l and money.” ■ 
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By Adrian Cho 

E ven proponents admitted the experi- 
ment was a long shot. Working with 
a few lasers and mirrors, physicists 
at Fermi National Accelerator Labo- 
ratory (Fermilab) in Batavia, Illinois, 
claimed they could test perhaps the 
wildest idea from string theory: that our 
universe may be like an enormous holo- 
gram ( Science , 13 April 2012, p. 147). Oth- 
ers scoffed that the experiment couldn’t 
possibly probe the “holographic principle.” 



So last week, when experimenters with the 
$2.5 million Fermilab Holometer reported 
that they had seen no evidence to support 
the principle, critics were unimpressed. 

“Not surprising, as the idea underlying it 
is nonsense,” tweeted Sabine Hossenfelder, 
a theorist at the Nordic Institute for Theo- 
retical Physics in Stockholm. However, 
Yanbei Chen, a theorist at the California 
Institute of Technology in Pasadena, says 
the experiment and its inventor, Fermilab 
theorist Craig Hogan, deserve credit for 
trying. “At least he’s making some effort 
to make an experimental test,” Chen says. 
“If the string theorists complain that this 
is not testing what they’re doing, well, they 
can come up with their own tests.” 

The holographic principle springs from 
the theory of black holes, spherical regions 
where gravity is so intense that not even 
light can escape. In theory, a black hole has 
an amount of disorder, or entropy, that is 



proportional to its surface area. As entropy 
is related to information content, some the- 
orists have conjectured that the maximum 
amount of information in any volume of 
space and time, or spacetime, may be pro- 
portional to its surface area. The universe 
would then work like a like a hologram, in 
which a 2D pattern captures a 3D image. 
The idea could guide string theorists in 
their quest to meld the theories of gravity 
and quantum mechanics. 

In 2009 Hogan dreamt up a way to test 
the idea. The holographic principle might 
come about, he argued, 
if coordinates in differ- 
ent directions— up-down, 
forward-backward, right- 
left— obey a quantum 
mechanical uncertainty 
relationship like the 
Heisenberg uncertainty 
principle, which says it is 
impossible to know both 
the position and momen- 
tum of a particle such 
as an electron. It would 
then be impossible to de- 
fine position, at least on 
scales of 10' 35 meters. 

Hogan figured he could 
spot the effect using L- 
shaped optical devices 
known as interferometers, which serve 
as ultraprecise rulers. If position really 
is hazy, then “holographic noise” should 
cause the output of an interferometer to 
jiggle at millions of cycles per second, he 
claims. If two interferometers were nestled 
one inside the other to probe the same 
volume of spacetime, their holographic 
noise would show telltale correlations— if 
the noise existed. However, using two in- 
terferometers with 39-meter arms, Hogan, 
Fermilab experimenter Aaron Chou, and 
colleagues saw no such correlated noise. 

Critics say Hogan’s idea is half-baked 
and has nothing to do with the holographic 
principle. Hogan is undaunted. “For me, 
the big news is that we have a technique for 
measuring spacetime at this level,” he says. 
He plans to reconfigure the holometer to 
look for another possible sign of quantum 
gravity: an uncertainty in angular orienta- 
tion within spacetime. ■ 
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The Fermilab Holometer was built on a shoestring in an abandoned tunnel. 
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ANTHROPOLOGY 

Deadly attack on isolated tribe 

Isolated people recovering from conflict with settled 
neighbors in the Brazilian Amazon 



By Barbara Fraser 

T his fall, some members of the settled 
Matis tribe saw a group of naked 
strangers along the riverbank near 
their village of Tawaya in the Amazon 
rainforest of western Brazil. The Ma- 
tis, who first made contact with the 
wider world in the 1970s, have shotguns and 
wear Western-style clothes. But they knew 
from previous encounters who the strangers 
were: Members of the Korubo tribe, who live 
deep in the forest, follow a traditional way of 
life, and typically shun interactions with out- 
siders. The Matis took the 10 Korubo— five 
adults, a young man, and four children— 
back to their village and contacted FUNAI, 
the Brazilian government agency respon- 
sible for indigenous peoples. 

To FUNAI, this was a health emergency. 
Isolated people lack immunity to common 
diseases, so workers arrived 3 days later to 
provide medical care to that first group of 
Korubo and more who came to the village a 
week later. As they talked with the Korubo, 
FUNAI officials heard some disturbing news: 
Some tribespeople said they had survived 
an attack by members of the Matis tribe in 
December 2014. At least eight Korubo were 
killed in what was apparently revenge for 
the deaths of two Matis men, also last De- 
cember. The Matis alerted FUNAI after their 
people were killed, but the agency did not 
authorize contact. 



These revelations, made public only late 
last month by FUNAI, have rekindled a 
broader debate that divides experts inside 
and outside Brazil as isolated indigenous 
peoples increasingly emerge from the forest, 
displaced by mining, logging, and illegal ac- 
tivities ( Science , 5 June, pp. 1061, 1072, 1080). 
After deliberate contacts in the last century 
often proved catastrophic, with disease and 
violence killing two-thirds of the members 
of some newly contacted groups, FUNAI 
resolved not to initiate contact. “The goal 
is not to keep them isolated,” says Carlos 
Travassos, who heads the agency’s office for 
people in isolation in Brasilia. “The goal is to 
protect them so they can decide if they want 
to establish contact or not.” 

But in this case, some say that FUNAI 
should have taken the initiative and con- 
tacted the group. Anthropologist Robert 
Walker of the University of Missouri, Colum- 
bia, says the tragic deaths of the Matis and 
the Korubo are “a good example of why the 
‘leave them alone’ strategy doesn’t work.” 
The events are the latest in a series of 
sometimes hostile encounters between the 
Matis and isolated Korubo. Their territories 
have long overlapped in what is now the 
Vale do Javari Indigenous Territory, home to 
16 isolated groups as well as about 4500 set- 
tled indigenous people, according to FUNAI. 
In 2011, a group of Matis established a village 
in an area known to be used by the Korubo. 
FUNAI workers encouraged the Matis to 



A Korubo man and boys made contact with officials in 
the Brazilian Amazon this fall. 

avoid contact with the Korubo, who some- 
times appeared along the river. 

Then in December 2014, a group of iso- 
lated Korubo killed two Matis men with 
wooden clubs, apparently after some Korubo 
died of illnesses that they blamed on an en- 
counter with the Matis, Travassos says. Some 
of the Matis then requested permission to 
establish contact with the Korubo, accord- 
ing to a letter from the Matis Indigenous As- 
sociation to the Brazilian Attorney General’s 
Office. But FUNAI officials believed that the 
Korubo’s hostility indicated that they did not 
want contact. To prevent conflict, the agency 
instead convinced the Matis to relocate to 
Tawaya, Travassos says. He adds that FUNAI 
learned of the revenge killings of the Korubo 
and their original deaths from illness only 
after speaking with tribespeople this fall, 
and that details of the December encounters 
are still unclear. 

Health workers are continuing to pro- 
vide health care for the Korubo. Respiratory 
illnesses— which are the most common 
cause of death in cases of recent con- 
tact— were reported, according to Douglas 
Rodrigues, a physician and public health ex- 
pert at the Federal University of Sao Paulo, 
who is in contact with the FUNAI team on 
the ground. Officials moved the Korubo 
camp away from the village to help prevent 
any additional disease transmission, and the 
illnesses are now under control, Rodrigues 
says, adding that a Korubo baby was born. 
Medical care and long-term monitoring are 
crucial, he says, because it can take four or 
five generations for people to develop natu- 
ral resistance to diseases. 

But the Matis fear further conflict. On 
1 December, the Matis association issued 
a statement calling for a meeting with the 
president of FUNAI and for the agency to 
send two teams to the area. The statement 
says that Matis villagers saw more Korubo 
near Tawaya in early November, and that 
stronger action by FUNAI is needed to 
avoid more deaths. If attacked, “the Matis 
will retaliate,” the statement said. Travassos 
says that FUNAI takes this situation very 
seriously but that the agency’s limited re- 
sources must stretch to cover many other 
isolated groups. 

Antenor Vaz, a former FUNAI official, 
says the events show that FUNAI needs to 
improve on-the-ground monitoring and re- 
spond more readily when contact is likely. 
“Within the policy of protection, contact is § 
foreseen,” he says. “The government and so- % 
ciety should be prepared for it.” ■ | 



Barbara Fraser is a freelance writer in Lima. £ 
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INFECTIOUS DISEASES 

HIV/AIDS care for all-on a budget 

Meeting promotes ways to cut clinic visits and tests without compromising treatment 



By Jon Cohen, in Harare 

E very 2 months, a Land Cruiser stuffed 
with health care workers leaves Chi- 
damoyo Hospital here early in the 
morning and drives 90 minutes east 
across washboard roads to the farm- 
ing village of Nyamutora. Boxes of 
antiretroviral (ARV) drugs and medical 
records crowd the back of the vehicle. Sit- 
ting on pews at a Salvation Army church, 
nearly 150 HIV-infected people are waiting 
for the nurses and counselors. By the time 
the Land Cruiser leaves about 6 hours later, 
all will have received enough drugs to last 
until the team’s next visit, had their weight 
documented, and spoken 
with a nurse. Some will have 
had blood drawn for later 
analyses to test how they’re 
responding to treatment. 

This routine departs from 
the usual model of HIV 
treatment, in which patients 
receive ARVs at frequent 
clinic visits. Yet the 143 Nya- 
mutora villagers who have 
been given their ARVs every 
2 months through this 3-year 
outreach effort have done 
remarkably well, Benjamin 
Chimukangara, a molecular 
virologist at the Biomedical 
Research and Training In- 
stitute in Harare, reported 
here last week at the Interna- 
tional Conference on AIDS 
and STIs in Africa. Only two 
have died from AIDS— both began treatment 
when they were very ill— and all but four oth- 
ers, or 96%, have reduced their “viral load” to 
the level that the World Health Organization 
(WHO) uses to define treatment success- 
less than 1000 virus particles per milliliter 
of blood. “They’ve shown an unbelievable 
adherence and response,” said study leader 
David Katzenstein of Stanford University in 
Palo Alto, California. 

The result is good news for efforts to 
lower the cost of HIV treatment, so that poor 
countries like Zimbabwe can come closer to 
m achieving a new goal: universal treatment 
| for HIV. Until today, most governments have 
2 reserved ARVs for people who have shown 
p at least some immune system damage from 
p HIV. But new evidence shows that immedi- 

o 

l ate treatment staves off AIDS, and in Sep- 
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tember WHO called on countries to offer the 
drugs to all HIV-infected people. There are 
an estimated 21 million untreated people, 
12 million of whom live in Africa. Without 
a bolus of new cash, which no one expects, 
governments will have to come up with 
ways to cut costs without compromising the 
quality of care. The Nyamutora project is 
precisely the sort of simple, innovative “al- 
ternative delivery model” that can help. “The 
paradigm should be services going to people, 
and today, most people are going to services,” 
Michel Sidibe, head of the Joint United Na- 
tions Programme on HIV/AIDS, told Science. 

Sidibe, who spoke at the meeting, noted 
that the world now spends $22 billion on 



HIV/AIDS a year, and yet the cost of drugs is 
only $80 to $100 per person a year in poor 
countries. Most of the global expenses are 
for clinic visits and lab tests. “It’s difficult 
for me to think we’ll be able to change com- 
pletely the scale of our treatment programs 
by continuing the traditional high-cost ap- 
proach,” he said. A new buzzword, “differ- 
entiated care,” calls for providing intensive 
services only to those who are ill or failing 
on medication: Those doing well can cut 
clinic visits back to twice a year and receive 
up to a 6-month supply of ARVs. 

The Nyamutora project, conceived by 
the local Salvation Army church and Chi- 
damoyo Hospital, shows how efficient that 
approach can be. Before the project began, 
people would either have to find round-trip 
transportation— which often meant paying 



$6, a hefty sum for most— or walk for an en- 
tire day to receive a month or two’s supply of 
medication. What the villagers call “Chidam- 
oyo day” also saves the hospital staff time 
and money: The village visit is an efficient 
way to see many patients in one fell swoop, 
and their main extra cost is $35 for gas. 

One reason patients can forgo frequent 
clinics visits is that routine monitoring of 
CD4s, the white blood cells that HIV targets 
and destroys, is increasingly being seen 
as a pointless money sink. At the meeting 
WHO recommended “stopping routine CD4 
count testing” for people who are stable on 
ARVs and instead monitoring viral loads 
as rarely as once a year. Using “dried blood 
spot” technology, which 
requires a simple finger 
prick rather than collecting 
venous blood to monitor 
viral load, could save even 
more money. Zimbabwe 
alone could save millions 
of dollars each year by us- 
ing the dried blood spot 
strategy instead of routine 
CD4 measurements, ac- 
cording to a model pre- 
sented by Paul Revill of the 
University of York in the 
United Kingdom. 

Less skilled workers can 
often do the same quality 
work as better-paid profes- 
sionals, several speakers 
said. Pharmacists, for ex- 
ample, need not distribute 
ARVs. Doctors Without 
Borders trained community health workers 
in rural Malawi to perform viral load tests 
accurately. “We have to explore every sin- 
gle piece of how to get as many people on 
treatment right now with the resources we 
have,” Deborah Birx, head of the U.S. Presi- 
dent’s Emergency Plan for AIDS Relief, told 
Science. 

Birx urged countries to put the new rec- 
ommendations into effect “within weeks, 
not years.” She also sees other opportuni- 
ties for savings. “You could visualize in 
these remote areas drones carrying drugs 
and drones carrying dried blood spots,” she 
suggests. “There are a lot of different inno- 
vations we haven’t even attempted.” ■ 



Reporting for this story was supported by 
the Pulitzer Center on Crisis Reporting. 
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HIV-infected people in a remote Zimbabwean village receive antiretroviral drugs at the local 
church once every 2 months. 
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M ND THE 



PHONE 



Devices that monitor and coach 
users bring promise and peril to 
the science of behavior change 



itbit activity trackers come with 
a “manifesto.” Over a photo of a 
fierce-eyed jogger, the company’s 
website proclaims: “Every moment 
matters and every bit makes a big 
impact. Because fitness is the sum 
of your life.” 

Roughly 20 million people have 
found that message compelling 
enough to order a Fitbit. Many more seek 
out other devices and smartphone apps de- 
signed to count their steps, their calories, or 
their hours of sleep; to help them quit smok- 
ing, drinking, or stressing; or to help manage 
chronic illness. The distillation of daily life 
into a motivational stream of stats has be- 
come a booming industry— the world of the 
quantified self. 

This life-tracking craze has produced 
something that many clinical researchers 
covet: a deluge of intimate data about in- 
dividuals’ moment-to-moment behavior in 
“the wild,” as researchers sometimes call the 
world outside the controlled environment of 
the lab or the clinic. “It’s sort of opening a 
window into parts of people’s lives we haven’t 
really had access to before,” says Ida Sim, a 
physician and informaticist at the University 
of California, San Francisco. 

Once, peeking through that window re- 
quired equipping subjects with elaborate mo- 
tion and heart rate monitors designed spe- 
cifically for research. But now that roughly 
two-thirds of U.S. adults own smartphones 
equipped with GPS systems, cameras, and 
light and motion sensors, “people are think- 
ing, ‘Oh, well maybe I could just get [data] off 
somebody’s phone while it’s in their purse,” 
Sim says. “It’s bringing in a whole new group 
of people who are asking new questions.” 






By Kelly Servick 
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For example, researchers wonder whether 
they can finally discover just how much 
exercise— and what kind— makes for a 
healthy heart, and what strategies help 
people stop smoking for good. For many 
researchers, the hope is that mobile devices 
will allow them to go beyond collecting 
data to influencing behavior on a massive 
scale. Through activities built into an app 
or strategically timed alerts and messages, 
researchers can attempt to monitor and 
modify the habits of thousands of people 
simultaneously. Major university health cen- 
ters and government funding agencies hope 
“mHealth” will finally make a dent in intrac- 
table public health problems, from obesity to 
tobacco use to depression. Sim, for example, 
collaborates on a 4-year, 11-university proj- 
ect funded by a $10.8 million grant from the 
U.S. National Institutes of Health (NIH), to 
design new analytical tools for interpreting 
mobile data and using them to combat dis- 
ease. The team is already developing mobile 
technologies to help people manage conges- 
tive heart failure and quit smoking. 

But harnessing the self-tracking trend 
to promote healthier behavior is far from 
a sure bet. The world of commercial self- 
improvement apps is “the world of the 
cowboys,” says clinical health psychologist 
Bonnie Spring of Northwestern University 
Feinberg School of Medicine in Chicago, 
Illinois. Commercial app designers “are re- 
ally unbothered by the kind of standards of 
evidence that we care about,” says Spring, 
who studies behavioral treatments for obe- 
sity and tobacco addiction and collaborates 
on the NIH project. Few commercial apps 
have actually been shown to help change us- 
ers’ behavior, improve their health, or even 
take accurate measurements. 

Researchers hoping to bring rigor to the 
Wild West of mobile sensors are still wad- 
ing through fundamental questions: Do the 
raw data from a phone or wearable device 
reliably measure behavior? Does getting 
feedback about their behavior really help 
people change it? And how do you keep the 
download-happy masses from quickly losing 
interest or ignoring your app? Sim says it’s 
hard not to let expectations about mobile 
health research soar beyond the evidence. “I 
think right now it’s still a lot of excitement, 
and a lot of hype.” 

IN THE 1960s, walking clubs in Japan ad- 
opted a new fad: a commercial pedometer 
called rimnpo-kei, literally meaning “10,000- 
step meter.” Researchers were soon exploring 
the health benefits of the handy-but-arbitrary 
goal of 10,000 steps per day. Today, it’s the de- 
fault goal on every new Fitbit. 

But scientists still don’t know whether it’s 
the right exercise goal, says Euan Ashley, a 
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cardiologist at Stanford University in Palo 
Alto, California. “Is it better to do vigorous 
exercise on weekends, or is it better to accu- 
mulate 10,000 steps a day? We don’t know,” 
he says. “It’s almost like we have some- 
thing more powerful than any drug that we 
have for cardiovascular disease— physical 
activity— but we don’t know how to dose it.” 
He believes that the answer could lie in mo- 
bile health data. 

Research that relates behavior to health 
has often relied on crude surveys that ask 
patients to remember and report what 



Encouraging words 

Many behavior-change programs designed for mobile 
phones rely on motivational texts to engage users all 
day. The U.K.-based txt2stop service tailors messages 
about quitting smoking to a user’s responses. 

Day4=Big day - cravings still 
strong? Don’t worry tomorrow 
will be easier! Keep your mind 
& hands busy. Save this txt so u 
can txt CRAVE to us at any time 
during the programme. 

txt2stop service 



Cravings last less than 5 minutes 
on average. To help distract 
yourself, try sipping a drink slowly 
until the craving is over. 

txt2stop, Response to text “crave” 



Don't feel bad or guilty if you’ve 
slipped. You’ve achieved a lot by 
stopping for a while. Slip-ups can 
be a normal part of the quitting 
process. Keep going, you can do it! 

txt2stop, Response to text “lapse” 

they’ve been up to. ‘“What did you do on 
Monday? How many flights of stairs did you 
do on Tuesday?’ That’s literally how these 
studies are carried out,” Ashley says. “I can 
barely remember what I had for breakfast, 
never mind what I did last Wednesday.” 
Even big, successful, longitudinal studies 
like the famous 67-year-old Framingham 
Heart Study have relied on occasional sur- 
veys to spot correlations between behaviors 
and measures of health. 

Other studies have taken people out of 
“the wild” for stints of close observation. 
Participants in sleep research may spend 



days or weeks in the lab, sometimes wired 
up with sensors or lying in magnetic reso- 
nance imaging scanners, for example. But 
the effort and cost of recruiting and com- 
pensating subjects makes large-scale stud- 
ies impossible. 

Mobile phones and wearable sensors of- 
fer a much cheaper way to get huge sample 
sizes ... if they measure what they say they 
measure. “If we’re going to do science with 
these devices, we really want to validate 
them ourselves,” says Ashley, who is in the 
middle of that unglamorous task. He has 
rounded up all the major commercial fit- 
ness trackers to see how they compare with 
clinical grade equipment on their measures 
of heart rate and calories burned. His pre- 
liminary finding, which matches other re- 
cent studies, is that devices tend to agree on 
heart rate, but calorie counts are “kind of all 
over the place.” 

Ashley is also experimenting with a new 
system for gathering health and activity in- 
formation from iPhone users. He’s one of 
more than a dozen investigators who have 
launched apps using ResearchKit, Apple’s 
open-source software platform for scientists, 
unveiled in March. His team’s app, called 
MyHeart Counts, pulls data from phone 
accelerometers, which track daily step 
counts and can record participants’ perfor- 
mance on a 6 -minute test of walking speed. 
The researchers can then explore how those 
readings correlate with participant-reported 
cardiovascular risk factors, diet, and mood. 
In its first month, the app recruited 30,000 
participants, all of whom opted to share 
data through an informed consent form on 
their phones. By now more than 47,000 have 
signed up. 

Ashley is just beginning to analyze the 
data, but his group is already developing a 
new version of the app. It turns the phone 
from a monitor into a coach, nudging par- 
ticipants to do more exercise. 

Psychologists like Spring welcome such 
efforts to wrestle behavior-change strate- 
gies onto our tiny screens. They say that al- 
though pocket-sized counseling or coaching 
is unlikely to replace traditional in-person 
sessions, it might extend the reach of such 
interventions. “We know that the people 
we’re helping are in some ways those least 
in need— the ones who can afford to come, 
who have the time, who can pay for the 
parking,” she says. 

In one early attempt, Spring and her col- 
leagues designed an app that draws on prin- 
ciples of the Diabetes Prevention Program— 
a clinically tested curriculum that she calls 
“the most successful weight loss approach 
ever.” Central to that approach is getting par- 
ticipants to religiously count their fat and 
calorie intake and track their weight, Spring 
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Healthy habits 

MIT researchers tracked undergraduates’ lifestyles and well-being by asking them to report their activities on their phones over 30 days; the calmest 20% and most 
stressed 20% of students, as sorted by an end-of-month evaluation, had some different habits. 
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says, which can be a challenge. Hoping to 
keep users engaged, her group’s app depicts 
calorie and fat allowances in colorful meters 
that fill up over the course of the day. 

Other researchers are testing apps meant 
to help recently abstinent smokers avoid re- 
lapse. A U.K.-based program called txt2stop 
simply sends tailored text messages, such 
as “Day4=Big day - cravings still strong? 
Don’t worry tomorrow will be easier! Keep 
your mind & hands busy.” Users can text the 
word “crave” at any time to get additional 
reinforcement, and “lapse” if they have 
smoked and need coaching through a slip-up. 

That approach depends on users to reach 
out or engage with their phones when they’re 
tempted, however. “When people need the 
most help, they are not the ones likely to 
ask for it,” says Santosh Kumar, a computer 
scientist at the University of Memphis in 
Tennessee who leads the big NIH-funded 
project known as Mobile Data to Knowledge 
(MD2K). Ideally, he says, an app would sense 
the user’s context— including the presence of 
potential temptations— to figure out when 
someone needs guidance and then provide a 
so-called “just-in-time intervention.” 

MD2K collaborators are working on one 
such system. It will infer stress— a known 



risk factor for a lapse in attempts to quit 
smoking— from heartbeat intervals in 
electrocardiogram data collected by a chest 
band. (Kumar notes that data from a smart- 
watch could also work, provided the watch’s 
heart monitor delivers reliable data.) The 
MD2K system will also detect when people 
smoke without their having to report it, 
by combining breathing patterns from the 
chest band with readings of arm motion 
gathered by a motion-sensing wristband. 
The hope is that stress-relieving exercises 
can be timed to moments when a person is 
most vulnerable to an urge or most recep- 
tive to encouragement. 

Another context-dependent smoking app, 
called Q Sense, is under development in the 
lab of Felix Naughton, a health psychologist 
at the University of Cambridge in the United 
Kingdom. The app first uses a phone’s GPS 
system to tune into a person’s habits and 
learn where they are most likely to smoke— 
for example, in the pub or outside the work- 
place. Once people start to quit, they’ll re- 
ceive tailored messages of encouragement 
when they breach a certain radius of these 
locations. Being in the workplace, for exam- 
ple, might trigger instructions for a stress- 
reduction technique. 



DESPITE THE FLURRY OF RESEARCH, 

the first generation of behavior-change 
apps has a spotty report card. The U.K. 
antismoking app txt2stop showed some 
benefit in a randomized study of 5524 par- 
ticipants. It doubled the rate of successful 
quit attempts after 6 months— from about 
5% in the control group to about 10% of 
text recipients. Although that may sound 
like meager progress, it’s cost-effective for 
health systems: The service costs £16,120 
but gains about 18 life-years per 1000 en- 
rolled participants. 

Spring’s weight loss app, meanwhile, in- 
spired an astounding level of self-tracking 
in its users. They entered their weight on 
more than 90% of days, she says. “I’ve never 
seen this. It was unimaginable.” But when 
she compared app users with people who 
tracked their weight and food intake with 
paper and pencil, the app seemed to provide 
no additional benefit in terms of pounds 
lost— both groups saw modest weight loss. 

Spring suspects that self-tracking makes 
users more careful with their diets, but only 
to a point. Perhaps participants maxed out 
the benefit they could get from seeing their 
own data, so the app provided no advantage. 
If the researchers want more clinical im- 
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provements, Spring says, they’ll have to add 
in some other approach. 

Other app studies have struggled to re- 
veal any long-term benefits at all. A recent 
meta-analysis of 14 mobile weight inter- 
ventions found an average weight loss of 
only about 1.4 kilograms compared with 
control groups. And a 2013 review of 21 ran- 
domized controlled trials of mobile inter- 
ventions for obesity, diabetes management, 
smoking, and other health challenges found 
that less than half led to improvements in a 
relevant measure of health. 

“Frankly, this is so new that I’m not sure 
that we know that it works— that it makes 
a difference,” says Arthur Stone, a behav- 
ioral scientist at the University of Southern 
California in Los Angeles, who co-authored 
the review. In the 1990s, Stone was an early 
pioneer of real-time health tracking, having 
developed a method known as “ecological 
momentary assessment” that encourages 
participants to log their activity and de- 
scribe their moods right when they experi- 
ence them. The goal was to give researchers 
a more detailed picture of subjects’ psycho- 
logical symptoms. But as smartphones take 
data gathering to its extreme, he finds him- 
self among the skeptics. “Do we need the 
incredible density of data that we seem to 
automatically want to go to?” he wonders. 
“A lot of times, we’re measuring things be- 
cause we can measure them, and we don’t 
know exactly why we’re measuring them.” 

The new generation of just-in-time inter- 
ventions faces other hurdles. In a recent fea- 
sibility study to learn how smokers would 
use his Q Sense app, Naughton found that 
about half the time, users didn’t open the 
app for more than 30 minutes after they re- 
ceived a notification. That means the inter- 
vention likely wasn’t reaching people at the 
intended moment. 

The question of how and when a phone 
should interrupt a person has become a 
field of study in itself. Computer scientist 
Veljko Pejovic at the University of Ljubljana 
and colleagues have tried to model users’ 
“interruptibility” by gathering their feed- 
back about messages and alerts at various 
points in the day. So far, his results can’t 
offer a generalizable strategy. “It’s very per- 
sonalized,” he says. People may engage with 
or ignore a message based on their location, 
the time of day, the kind of activity they’re 
involved in, and whether they’re starting or 
finishing a task. 

g The MD2K team worries that users won’t 

0 be able to focus on an alert when they need 

1 it most: at times of stress. So in January 
J they’ll launch a new study of their system, 
| involving 75 smokers, which will be “micro- 
| randomized.” A given user will sometimes 
d receive an alert telling him or her to do a 
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stress management exercise at moments of 
high stress; at other times, the alert will ar- 
rive when stress is deemed low. Combined 
with records of smoking from wearable sen- 
sors, the data might reveal which strategy 
has the greatest impact. 

It may turn out that no one behavior- 
change strategy will work for everyone. The 
MD2K team, for example, plans to eventu- 
ally personalize the timing of alerts for 
each participant. As apps bring in richer 
data about each individual user, it’s becom- 
ing clear that “what predicts behavior in 
groups doesn’t necessarily predict behavior 
in individuals,” Naughton says. 

Rosalind Picard, a computer scientist at 
the Massachusetts Institute of Technology 
(MIT) in Cambridge, is developing highly 
personalized interventions that would be 
sensitive to users’ state of mind. 

Picard was motivated by a tragedy: In 
2013, she learned that a former graduate 
student had taken his own life. Her team 
started thinking about how wearable sen- 
sors could relieve stress and prevent 
depression. “It’s one thing to study all 
this,” she says. “It’s another to build 
it into a form that people can start 
changing their lives around.” With 
support from a memorial fund 
organized by her former student’s 
mother, Picard’s lab has begun 
studying work-related stress and 
strategies for relieving it. 

A first step, published this year, 
tracks a group of MIT undergrads 
over 30 days, collecting data from 
wearable wrist sensors on move- 
ment, skin conductivity, and tem- 
perature, as well as smartphone 
records of location, calls, and text 
messages. The team then related 
these measurements to self- 
ratings of stress, health, energy, 
alertness, and happiness (see 
graph, p. 1308). Initial insights 
were far from shocking: Spend- 
ing extra time outdoors and 
getting ample, consistent sleep 
were among the factors predic- 
tive of happiness, for example. 

But the technology Picard 
envisions down the road is 
more elaborate: a system 
that trains itself to fore- 
cast an oncoming anxiety 
attack or a bout of depres- 
sion, based on sensor- 
derived signals that are 
unique to an individual, 
and that alerts wearers 
when they might be in 
trouble. For example, if 
a person’s sensor and 
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mobile data showed that she was sleeping 
irregularly or using her phone late at night 
at times when she felt down, the system 
might automatically send a reassurance or 
suggest that she get more sleep. 

It remains to be seen whether such a 
system could be reliable, not to mention 
how government regulators would view 
it. “We’re kind of where weather fore- 
casting was 150 years ago. People looked 
at the farmer’s almanac, and then the city 
got wiped out that night by a storm and 
they didn’t see it coming,” Picard says. But 
she thinks the technology is now good 
enough for researchers to think about pre- 
dicting human behavior. “It’s not as good 
as weather forecasting yet, but it’s better 
than random.” ■ 
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EVOLUTION 

Dissecting 
diversity in the 
social brain 

Male prairie voles with poor 
memory skills are less 
faithful to their partners 

By Gene E. Robinson 

S oon after launching an ambitious 25- 
year study of the population cycles of 
rodents in the fields and prairies near 
the University of Illinois at Urbana- 
Champaign in 1971, Lowell Getz saw 
something strange. Adult male and fe- 
male prairie voles (. Microtus ochrogaster) of- 
ten appeared in the same live trap, unlike the 
meadow vole, whose sexes were more segre- 
gated. Moreover, the same pairs were often 
retrapped months later. Using radio track- 
ing, Getz and his co-workers found that most 
trapped pairs were long-term partners living 
together in underground nests and sharing 
common home ranges (1). So began almost 
50 years of pathbreaking research into the 
biology of prairie vole monogamy, in one of 
fewer than 5% of mammalian species with a 
monogamous lifestyle (2, 3). On page 1371 of 
this issue, Okhovat et al. (4) use the prairie 
vole model system to investigate individual 
differences in this social behavior. Differ- 
ences in social behavior are widely observed 
but poorly understood in most species. 

A hallmark of the prairie vole model is 
its strong multidisciplinarity, from the field 
to the lab, and perhaps eventually to the 
clinic ( 3 ). Soon after his initial findings, Getz 



Carl R. Woese Institute for Genomic Biology, Department 
of Entomology, Neuroscience Program, University of 
Illinois at Urbana-Champaign, Urbana, IL 61801, USA. 
E-mail: generobi@illinois.edu 




1310 11 DECEMBER 2015 • VOL 350 ISSUE 6266 



sciencemag.org SCIENCE 



Published by AAAS 





teamed up with neurobiologist Sue Carter 
to show that prairie vole mating in the labo- 
ratory results in the kind of robust partner 
preference that would lead to long-term pair- 
bonding in nature (2). Neurobiologists began 
to use prairie voles to study the role of the 
neuropeptide oxytocin in female monogamy 
(5). Focusing on the male prairie vole brain, 
Insel and Shapiro showed that the distribu- 
tion of a receptor for the closely related va- 
sopressin, VlaR, differed strikingly from 
closely related polygynous species, 
with a hot spot of expression in the 
ventral pallidum, a part of the 
basal ganglia known to mediate 
reward and motivation (3). In- 
sel, Young, and their co-workers 
demonstrated that increased 
VlaR expression in the ventral 
pallidum increased partner preference in 
transgenic voles and mice (3). 

Okhovat et al. now return to the field, 
blending genomics with ecological and 
evolutionary analyses made possible by be- 
havioral studies in seminatural enclosures. 
They take advantage of the fact that prairie 
voles differ in their degree of monogamy. 
Some stray from their mate and engage in 
frequent extra-pair fertilization (EPF); oth- 
ers are more faithful, engaging mostly in 
intra-pair fertilization (IPF). Phelps and col- 
leagues ( 6 ) showed earlier that EPF males 
have lower levels of VlaR expression than 
IPF males, but with a twist on the origi- 
nal VlaR findings: The lower levels 
of VlaR expression are not seen in 
the ventral pallidum but rather 
in parts of the brain involved in 
spatial learning, including the 
retrosplenial cortex. 

Phelps and colleagues ( 4 ) 
confirm these findings and con- 
clude that they are likely due to differ- 
ences in the regulation of avprla, the gene 
encoding VlaR. They base this conclusion 
on DNA sequencing of the avprla region 
of the prairie vole genome, chromatin im- 
munoprecipitation, bisulfite sequencing, 
and bioinformatic analysis. In the course of 
these experiments, the authors discovered 
four genetically linked single-nucleotide 
polymorphisms (SNPs) in putative enhancer 
regions for avprla, which puts them in the 
right places to influence avprla expression. 
They also found behaviorally related differ- 
ences in methylation in various parts of the 
avprla gene, with enhancer methylation 
correlated with retrosplenial cortex avprla 
expression. 

Consistent with best practices in behav- 
ioral genetics, the authors replicated the be- 
havior-SNP associations with animals from 
a different population. Nonetheless, it will 
be important to further examine the causal 



Degrees of faithfulness. Okhovat et al. report behavioral, neurobiological, genomic, and evolutionary analyses 
that explain individual differences in male prairie vole monogamy. They attribute the finding that less faithful (EPF) 
males range more widely than more faithful (IPF) males to poorer spatial memory skills, which are related to striking 
differences in the regulation of avprJa and the expression of VlaR in the brain. 



relationship between genetic and behavioral 
variation. Surely CRISPR/Cas9 prairie voles 
will not be long in coming. 

Okhovat et al. also use radio-tracking data 
to show that EPF males have relatively larger 
home ranges, which overlap with the terri- 
tories of other males. Are EPF males more 
amorous, gregarious, or novelty-seeking? We 
do not know yet, but the authors use their 
neuroanatomical data to argue for a more 
prosaic explanation: poor spatial memory. 



The idea builds on an earlier suggestion 
(7) that IPF males might remember painfully 
well the locations of rough encounters with 
other males and thus stay closer to home, 
whereas EPF males are less inhibited by un- 
pleasant spatial memories and continue to 
roam (see the figure). This speculation pro- 
vided the basis for Okhovat et alls hypothesis 
that the well-known behavioral ecological 
trade-off between siring additional offspring 
and being cuckolded while out and about op- 



Variation favoring 
low VlaR 



EPF males roam into neighboring 
territories and mate with 
additional females 



Brain 



IPF males stay closer 
to home and are 
more faithful 
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erates for prairie voles. They report a positive 
correlation between how often a male in- 
trudes on a neighboring male’s territory and 
how often his own territory is intruded upon 
by another wandering male. An EPF male en- 
counters more females than IPF males and 
thus more opportunities for mating, but so 
does his partner back home. This trade-off 
is etched in the genome, with evidence of 
balancing selection for the above-mentioned 
avprla SNPs, but no such evidence at several 
other locations in the prairie vole genome. 

Okhovat et al. propose that high popula- 
tion densities favor genetic variants result- 
ing in lower VlaR expression, poorer spatial 
memory, and more expansive home ranges 
to capitalize on enhanced possibilities of 
extra-pair matings. Low population densi- 
ties would favor the inverse of these traits. 
In other words, the evolutionary explana- 
tion for the persistence of both EPF and 
IPF males points to the very same cycles of 
population density that originally motivated 
Getz’s field studies. 

The study by Okhovat et al. impressively 
bridges mechanistic and evolutionary anal- 
yses to provide a detailed picture of indi- 
vidual differences in social behavior. Future 
studies should try to integrate the spatial 
learning and partner preference narratives 
for both males and females; the joint evolu- 
tionary dynamics of male and female traits 
must be considered to fully understand a 
mating system (8). With the availability of 
the prairie vole genome, future analyses 
also will no doubt include efforts to iden- 
tify other genes that interact with avprla, 
in both mechanistic and evolutionary con- 
texts ( 9 ). Measuring the effects of changes 
in population density on gene expression 
throughout the brain will help us better 
understand how nature and nurture shape 
social life (10). M. ochrogaster has come a 
long way from the traps on the prairie and 
clearly has much more to teach us. ■ 
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By Jorge L. Contreras 1 * and 
Jerome H. Reichman 2 

» mbitious international data-sharing 
initiatives have existed for years in 
fields such as genomics, earth sci- 
ence, and astronomy. But to realize 
the promise of widespread sharing of 
scientific data, intellectual property, 
data privacy, national security, and other 
legal and policy obstacles must be over- 
come (1). Although these issues have at- 
tracted much attention in some circles, they 
have often taken a back seat 
POLICY to addressing technical chal- 
lenges. Yet failure to account 
for legal and policy issues at the outset of 
a large transborder data-sharing project 
can lead to undue resource expenditures 
and data-sharing structures that may of- 
fer fewer benefits than hoped. Drawing on 
our experience with the Belmont Forum, 
a multinational earth change-research pro- 



“Even if resources do not 
exist... technically, there 
are advantages to fostering 
legal interoperability among 
distributed repositories.” 

gram, we propose a framework to help plan 
data-sharing arrangements with a focus on 
early-stage decisions including options for 
legal interoperability. 

A rich literature beginning with the work 
of Ostrom (2) addresses the organization 
and governance of common pool resources 
shared by communities of users in contexts 
ranging from the global environment to 
communal living spaces. More recent work 
has expanded these principles to knowl- 
edge commons: collections of intangible 
resources, such as digital libraries, scholarly 
publications, and scientific data (3). Re- 
sponding to calls for increased international 
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scientific collaboration, several expert bod- 
ies have developed high-level principles for 
transborder data sharing (4-6). Although 
these efforts lay the groundwork for broad 
data-pooling initiatives, critical design deci- 
sions must be made before larger issues of 
governance and operation. 

A SPECTRUM OF CENTRALIZATION. Al- 
though little empirical research exists on 
commons structures for data sharing and 
related costs, we have observed four basic 
structural models for scientific data pools 
along a continuum ranging from the most to 
the least centralized (see the table). 

(i) fully centralized : all data are aggregated 
in a single, centrally managed repository; 

(ii) intermediate distributed : repositories 
are distributed and separately maintained, 
but may be interconnected by a central ac- 
cess portal, share technical service compo- 
nents, and utilize a common data-exchange 
format [sometimes called a federated data- 
base system (7)]; 

(iii) fully distributed : repositories are 
maintained locally and are not technically in- 
tegrated, but share a common legal and policy 
framework that allows access on uniform 
terms and conditions (legal interoperability); 

(iv) noncommons: repositories are largely 
disaggregated and lack technical and legal 
interoperability and, at most, may share a 
common index. 

Centralized repositories with curation, an- 
alytics, and quality control can enhance the 
value of the data they contain [e.g., the Gen- 
Bank repository of DNA and RNA sequence 
data (8)]. Centralized structures, however, 
come at a cost and may be impractical in 
many transborder collaborations because of 
political, legal, and organizational issues. But 
the alternative to a fully centralized commons 
need not be a noncommons. The shortfalls 
of noncommons models include incompat- 
ible data formats, inability to search across 
data sets, underutilization of data resources, 
individualized and inefficient access require- 
ments, and difficulties moving data across 
national boundaries. Distributed commons 
structures, however, offer a meaningful sub- 
set of benefits with lower cost and resource 
commitments than fully centralized models. 

For example, an online portal through 
which researchers can access multiple inde- 
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pendent repositories may feel like a central- 
ized commons to users, but avoids the cost 
and governance overhead of a centralized 
repository [e.g., the Global Earth Observation 
System of Systems (GEOSS)]. Portal-based 
structures may make it easier for a central ad- 
ministrator to provide users with value-added 
services and aggregated statistics [e.g., the 
World Data Center for Microorganisms (P)], 
and allow users to more easily query, com- 
bine, and analyze multiple data sources (7). 

Even if resources do not exist to link reposi- 
tories technically, there are advantages to 
fostering legal interoperability among dis- 
tributed repositories (10). To achieve this 
across jurisdictions, rules for data access and 
usage must be compatible with each other, 
must comply with laws and regulations of 
relevant jurisdictions, and must address 
rights of ownership and control granted to 
data generators (11). Legal interoperability 
can enable researchers to access and use data 
across multiple repositories without seeking 
authorization on a case-by-case basis, which 
increases the likelihood that more data will 
be put to productive use. 

Perhaps the most straightforward path to 
legal interoperability is simply to contribute 
data to the public domain and waive all fu- 
ture rights to control it (11). This approach 
has been advocated by more than 250 orga- 
nizations that have endorsed the 2010 Pan- 



ton Principles for open data in science (12). 
Alternatively, researchers who wish to receive 
attribution credit for their contributions, but 
are otherwise willing to relinquish control 
over them, have released data under stan- 
dardized Creative Commons licenses that 
have been widely used for other online con- 
tent, including open-source code software, 
music, and photographs. 

Despite the simplicity and appeal of these 
approaches, they are not always feasible. 
Data will often remain subject to legal regula- 
tion that, for instance, explicitly or implicitly 
reveal personally identifiable information, 
were obtained from human research sub- 
jects, relate to sensitive technologies, or dis- 
close infrastructural details. Wilbanks and 
others, recognizing these requirements, have 
called for new models of informed consent 
and privacy protection to facilitate broad, so- 
cially beneficial sharing of at least some cat- 
egories of such data (13). 

DESIGN CONSIDERATIONS. If a collabora- 
tive research project has sufficient resources 
to create a centralized data repository with 
accompanying infrastructure and staffing 
(potentially millions of dollars up-front and 
thereafter for fully staffed and curated reposi- 
tories), important benefits can be achieved. 
In most cases, however, this level of funding 
will not be available and a distributed data 
commons could be a desirable alternative. 



We found, in our experience with the Bel- 
mont Forum, that the project’s leadership 
gave substantial weight to early aspirational 
statements regarding broad data sharing. Suf- 
ficient consideration may not have been given 
to potentially useful distributed data struc- 
tures. When, at the conclusion of a lengthy 
planning stage, it became apparent that a cen- 
tralized commons was beyond budgetary con- 
straints, the decision was made to settle for 
no commons at all and rely on lofty but non- 
specific data-sharing principles to motivate 
researchers to share data on their own (14). 
To help avoid such dilemmas in the future, we 
offer the following actionable framework for 
evaluating distributed data commons early in 
the project-planning phase: 

How many data repositories are under 
consideration ? If the number is small, then 
fully distributed, unlinked repositories (i.e., 
no commons) may suffice. Researchers may 
easily access each repository, and the cost of 
a commons structure can be avoided. 

Are there resources to develop a common 
data portal ' ? As the number of repositories 
increases, some form of commons structure 
will likely facilitate data sharing and usage. 
Although the cost is not trivial, a common 
portal can enhance the value and usabil- 
ity of the data. If funding for a data por- 
tal is not available, planners may wish to 
consider a fully distributed commons with 
legal interoperability. 



Structural models for scientific data pools 

Data-sharing options 



BENEFITS AND COSTS 


CENTRALIZED 


INTERMEDIATE DISTRIBUTED 


FULLY DISTRIBUTED 


NONCOMMONS 


Incremental research benefits 








Data access 


Access to all data in 
unified manner 


Access to multiple repositories 
through central portal 


Access to each repository 
separately, but under a 
common usage/access policy 
and single approval 


Ad hoc coordination with other 
repositories only 


Data analytics 


Most powerful search, 
analysis, quality assurance 
of aggregated data 


Cross-repository searching and 
analytics: Metadata and 
aggregate statistics can be 
developed by central authority 


Index/catalog only 


Index/catalog only 


Costs 


Up-front costs 


Structure and build 
centralized repository: 
Develop data interoperability 
mechanisms: Develop 
common usage policy 


Develop data interoperability 
mechanisms: Develop common 
usage policy 


Develop common usage policy 


Few up-front costs 


Ongoing centralized costs 


Operating and maintaining 
central repository: 
administering policies 


Operating and maintaining portal: 
administering policies 


Administering policies 


No central costs 


Ongoing distributed costs 


Few distributed costs 


Operating and maintaining 
repositories 


Operating and maintaining 
repositories 


Operating and maintaining 
repositories 


Governance overhead 


Central repository 


Central portal/services, each 
distributed repository, and 
interrelationships 


Each distributed repository 
and interrelationships 


Each distributed repository with 
minimal coordination 
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SIGNAL PROCESSING 

Matched filtering of 
ultrashort pulses 

Optical signal processing can help reduce noise 
in detection of ultrafast electrical pulses 
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Are data regulated in the relevant juris- 
dictions? This question is relevant no mat- 
ter which commons structure is selected. 
If data are not regulated or subject to 
human-subject, privacy, health, or similar 
legal regimes, consider releasing data to the 
public domain or licensing under a common- 
use license. If data are regulated in one or 
more relevant jurisdictions, planners should 
consider engaging legal experts to develop a 
common data access and use policy that com- 
plies with regulations in each jurisdiction. 
For example, if data include human genetic 
information, both genetic nondiscrimination 
laws and data privacy regulations should be 
considered. Legal interoperability, and the 
ability for users to access and use all data on 
consistent terms via a single authorization, 
will be achieved only if the most stringent ju- 
risdiction’s regulations are observed in each 
case or are otherwise addressed (13). 

Although the Belmont Forum will doubt- 
less produce a wealth of valuable earth sci- 
ence data, initial appreciation of data-sharing 
options might have facilitated decision-mak- 
ing and planning among its many national 
participants and might have resulted in a 
more robust data-sharing structure. Ad- 
dressing these design choices early— while 
acknowledging budgetary, legal, and political 
constraints— can save planning and imple- 
mentation costs later. 
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By Michael Vasilyev 

T he need to detect a small signal ob- 
scured by a large amount of noise is 
like the problem Prince Charming 
faced in his search for Cinderella. 
Fortunately, he had the magic glass 
slipper that perfectly fit only her. In 
signal detection and estimation theory, such 
a magic glass slipper that perfectly fits only 
the signal of interest is called a “matched 
filter.” Because every change in the signal 
shape requires a change of the matched fil- 
ter, it would be highly advantageous for the 
filter to be dynamically reconfigured accord- 
ing to the expected signal. For ultrashort 
(<100 ps) electrical signals that are near the 
limits of today’s electronics, such reconfigu- 
rable matched filters become extremely chal- 
lenging to make. On page 1343 of this issue, 
Ataie et al. (1) show how optical signal pro- 
cessing could help to achieve this goal. They 
demonstrate the detection of a single 80-ps 
pulse in the presence of a large amount of 
noise by converting electrical signals to opti- 
cal signals and subsequently approximating 
the matched filter with a potentially recon- 
figurable optical scheme. 

The matched-filter concept arises in the 
context of optimal detection of a signal oc(t) 
(see the figure, panels A and B) that is possi- 
bly scaled in magnitude, shifted in time, and 
degraded by additive noise (panel C). The 
optimal receiver correlates its input with the 
matched filter for a time-shift x [ x(t - x)*] 
producing the correlation function (panel D) 
with a tall and narrow peak at x = 0, thereby 



Department of Electrical Engineering, University of Texas, 
Arlington, TX 76019, USA. E-mail: vasilyev@uta.edu 



permitting easy detection. In the frequency 
domain, this is equivalent to multiplying 
the received spectrum by the matched fil- 
ter in the shape of the complex conjugate of 
the expected signal spectrum. For received 
signals shifted in time, the matched time- 
domain filter is shifted accordingly, or the 
frequency-domain filter is multiplied by a 
linear phase response. It is well known (2) 
that for a signal degraded by white noise, the 
matched filter maximizes the signal-to-noise 
ratio (SNR) in the optimal receiver. Optical 
receivers with matched filtering have been 
demonstrated to operate within a fraction 
of a decibel from the quantum sensitivity 
limit (3) and are being considered for use in 
future near-Earth and interplanetary laser 
communication links. 

Physically, the use of the matched fil- 
ter integrates all of the signal’s temporal 
or spectral components in such a way that 
they add up coherently (they are in phase) to 
create the peak of the correlation function, 
whereas the noise’s components add inco- 
herently (they have random phases). Thus, 
the resulting SNR is improved relative to 
the SNR of the individual components. The 
matched filter not only recovers the arrival 
time of the signal, but also compensates for 
the spectral phase distortions caused by sig- 
nal propagation (dispersion) in microwave 
transmission lines and optical fibers, which, P 
if left uncompensated, lead to unwanted £ 
pulse broadening (see the figure, panel B) as 8 
well as chirping and ringing effects in time | 
domain that further drown the signal in the p 
noise (panel C). 5 

The signal domain can be straightfor- 2 
wardly expanded to two-dimensional im- p 
ages. Here, the linear phase response in £ 
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the spatial frequency domain corresponds 
to image translation, whereas the para- 
bolic phase response represents distortions 
caused by free-space propagation (diffrac- 
tion in paraxial approximation). A simple 
example of a filter with matched phase 
response that is easily reconfigurable for 
various signals is the lens of the human 
eye. It compensates for the parabolic phase 
shift caused by the free-space image propa- 
gation to ensure that different spatial fre- 
quency components of the image arrive in 
phase (in focus) at each pixel of the retina. 
The eye’s lens easily accommodates (recon- 
figures) for focusing on object planes at 
various distances, and, if such a distance is 
beyond the eye’s accommodation range, the 
matched phase filtering can be provided by 
eyeglasses or a loupe (see the photo). 

Another example is a two-dimensional 
amplitude filter that dramatically facilitates 
the detection of a word message, as shown 
in panel E of the figure. This filter is only 
partially matched because it accepts differ- 
ent letters within each transmission window, 
and hence can receive different messages. A 
filter that is 100% matched to a message will 
have the amplitude windows in the shapes 
of the specific letters of the message, which 
maximizes the correlation and SNR when 
the filter is aligned with the message. 

One of the applications most sensitive to 
the accuracy of the matched-filter shape is 
the homodyne detection of a quantum state 
of a particular temporal or spatial mode (4). 
Here, the mode of the homodyne local oscil- 



lator, when mixed with the signal, serves as 
the matched filter, and even small errors in 
its shape could couple considerable extra 
noise to the photocurrent. This noise could 
potentially destroy the detection of quantum 
states manifesting themselves in fragile pho- 
tocurrent features with amplitudes below 
the shot-noise level. Recent extensive efforts 
have identified the matched-filter shapes for 
temporally multimode (5-7) and spatially 
multimode (8-11) quantum states produced 
by various nonlinear processes. 

The scheme implemented by Ataie et 
al. nearly noiselessly replicates ultrashort 
electrical signal in the form of multiple 
frequency-shifted optical clones. The am- 
plitudes and phases of all frequency com- 
ponents of the signal can be measured 
simultaneously and independently by rela- 
tively slow, narrowband optical homodyne 
detection of the cloned signals. The local 
oscillators are supplied by the individual fre- 
quency components of a frequency comb. Af- 
ter compensating the linear phase response 
representing temporal shift, the amplitudes 
of the signal’s frequency components are 
coherently added together, while noises add 
incoherently, which improves the SNR. This 
addition is equivalent to using a partially 
matched filter with the optimum phase re- 
sponse but with a flat amplitude response. 

The approach of Ataie et al. builds on 
recent remarkable advances in optical 
combs and coherent signal processing 
technologies, and it offers flexibility that 
can potentially be used to adaptively syn- 



thesize matched filters for extremely short 
electrical pulses of arbitrary shapes. For ex- 
ample, the demonstrated scheme could be 
extended to make a 100% matched filter by 
properly weighing the added amplitudes. 
Moreover, because the addition is done by 
a digital signal processor, it can be easily re- 
configured to match amplitude and phase 
profiles of a variety of signals. In particular, 
it could compensate for a temporal signal 
shape distortion caused by group-velocity 
dispersion by performing parabolic rather 
than linear approximation of the phases of 
a signal’s frequency components. Such an 
adaptive filter synthesis for processing ul- 
trashort signals is very desirable in many 
applications, such as determining neutrino 
velocities by accurately detecting their ar- 
rival times (12) or hiding and uncovering in 
the noise a secure signal with a scrambled 
phase response [steganography (13)]. ■ 
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Matched filtering in the temporal and spatial domains. (A) A transform-limited (as short as possible for its bandwidth) pulse (red) and (B) the same pulse broadened by disper- 
sion (colors represent phases). (C) The measured real part of the latter pulse’s electric field Re(E) can be easily lost in the noise. However, correlation (D) of such a noisy dispersed 
pulse with a time-shifted matched filter x(t - x)* creates a readily detected peak at x = 0. (E) A message is hidden under clutter and noise but can be easily read by using a (partially) 
matched amplitude filter. 
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Opening the gate on superconductivity 

Superconducting two-dimensional materials are found to be remarkably robust 



By Hermann J. Suderow 

T he deleterious effect of the mag- 
netic field on superconductivity has 
plagued researchers and engineers 
alike for more than a century (1). 
Quantized vortices appear in most 
superconductors when applying a 
magnetic field and move under a current, 
creating dissipation that destroys the su- 
perconducting state. By pinning their 
motion through complex materials engi- 
neering, powerful magnets can be built for 
use in resonance imaging for medicine or to 
levitate trains. However, the magnetic field 
also perturbs superconductivity through 
the electron’s magnetic property (known as 
spin). Superconductivity requires electrons 
to form Cooper pairs, in which the spins 
point oppositely to form a singlet state. 
Lu et al. (2) on page 1353 of this issue and 
another recent study (3) show that two- 
dimensional (2D) superconductors such as 
MoS 2 can circumvent the negative effects of 
the magnetic field on Cooper pairs’ spin. To 
induce superconductivity in MoS 2 , a tran- 
sistor with two gates was used— a solid- 
state gate and a liquid gate— where moving 
ions generate a highly inhomogeneous elec- 
tric field on top of thin MoS 2 flakes. The 
resulting superconductor is extremely thin, 
just one MoS 2 layer thick, and is remarkably 
robust to parallel magnetic fields. Although 
there are other thin superconductors robust 
to parallel magnetic fields, the mechanism 
providing insensitivity to the magnetic field 
is a new advance. 

Monolayers of the material MoS 2 are of 
broad interest. For example, they are direct 
band-gap semiconductors, whose gap mag- 
nitude lies in the visible optical range (4). 
Conduction and valence bands are located 
at the corners of the Brillouin zone in two 
inequivalent sites K and K' (red and blue 
semicircles) (see the figure, panel B). This 
band structure is very similar to that of 
graphene, but there is an important differ- 
ence— the heavy Mo atoms provide strong 
spin-orbit coupling (5). This, together with 
the absence of in-plane inversion symme- 
try in single layers of MoS 2 (see the figure, 
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panel A), provides strong coupling between 
the spins of electrons lying at K and K' sites 
(5). The splitting of the bands for each spin 
polarization is inverted at the two sites. Be- 
cause of the splitting, the spins of electrons 
having the same energy and opposite mo- 
menta are oriented in opposite directions. 
This behavior mimics the spin and mo- 
mentum structure of a singlet Cooper pair 
(two electrons with opposite momenta and 
spin). By gating the split bands of the semi- 
conductor to the Fermi level, Cooper pairs 
form, and the relative orientation of their 
spin and momenta is fixed by the coupling 
between electrons among K and K' sites en- 
forced by the band structure. Because the 
splitting between the bands of opposite 
spin polarity is so large, the spins of the 
Cooper pairs remain effectively protected 
from external magnetic fields. 

The effects of the magnetic field on the 
Cooper pairs’ spin (7) have remained noto- 



riously difficult to control experimentally. 
It is thus of little surprise that many sci- 
entists active in superconductivity tend to 
think of vortices when discussing a super- 
conductor in a magnetic field. This can lead 
to strange situations when meeting experts 
in magnetism, who often have in mind that 
magnetic fields act mainly on electrons’ 
spin. I encountered such an expert when 
I was trying to convince a selection panel 
during a job interview a few years ago 
about the importance of vortices. The ex- 
pert stopped me and said that there was no 
point in discussing vortices to explain the 
effect on superconductivity in the magnetic 
field. He argued that the energy of the elec- 
trons shifted depending on the orientation 
of their spin relative to the magnetic field. 
Thus, one could only maintain a singlet 
Cooper pair if the energy shift is smaller 
than the superconducting gap. Fortunately, 
I was able to convince the panel that vorti- 




Ready to switch on. (A) Thin semiconducting MoS 2 is gated from below with a solid-state gate and from above with 
a liquid gate. By controlling the gate voltages, a 2D superconductor is formed in which Mo (dark green) and S (light 
green) atoms comprise a structure with no in-plane inversion symmetry. (B) The Cooper pair electrons are represented 
by colored spheres and their spin by perpendicular arrows. Cooper pairs form with opposite momenta ( k and -k). 

The black hexagon represents the first Brillouin zone. The Fermi surface consists of pockets at the edges (blue and 
red semicircles). Spin-orbit coupling and lack of inversion symmetry conspire to fix the spin of each electron to their 
momentum’s direction. 
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CANCER 

Revisiting vitamin C 
and cancer 

A high dose of vitamin C kills certain colon cancer cells 



ces are important, and that was enough to 
get the job. However, the insights presented 
by the experiments of Lu et al. show that the 
expert was partly right. 

Admittedly, there have been many dem- 
onstrations of the connection between 
magnetism and superconductivity, in par- 
ticular when superconductors containing a 
magnetic element were found: for instance, 
in the rare-earth nickel borocarbides ( 8 ), in 
ferromagnetic superconductors ( 9 ), or in 
the recent Fe-based materials (10). In these 
cases, time-inversion symmetry is broken 
in some form, particularly in the ferro- 
magnetic superconductors, where triplet 
Cooper pairs are required (with spins of 
the same orientation). However, the spin 
locking in MoS 2 preserves time-reversal 
symmetry. The same occurs in systems 
with so-called Rashba spin-orbit coupling. 
Such coupling arises in the absence of in- 
version symmetry and in materials with 
heavy elements and again locks electron’s 
spin and momenta. It is related to the 
electric-field gradient leading to spin-orbit 
coupling. The “fictitious” magnetic field 
generated by Rashba spin-orbit coupling 
is oriented in the plane of the Cooper pair 
electrons’ momentum. This can eventually 
favor mixtures of singlet and triplet Coo- 
per pairs (11). Estimates for the size of this 
effect, however, provide very small values 
for MoS 2 (2, 3). By contrast, the effect dis- 
cussed in MoS 2 is a direct consequence of 
the inequivalence of K and K' sites on the 
Fermi surface. 

Monolayers of the related material NbSe 2 , 
another transition-metal dichalcogenide, 
show similar behavior (12). Yet another re- 
cent work discovers amazing properties of 
2D vortices in ionic gate-induced supercon- 
ductivity in ZrNCl (13). It seems that a gate is 
being opened, likely showing beautiful vistas 
of superconductivity through a looking glass, 
where the crystal lattice is trump. ■ 
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By Colleen R. Reczek and 
Navdeep S. Chandel 

I n the early 1970s, the two-time Nobel 
Prize-winning chemist Linus Pauling 
proposed that high doses of vitamin 
C (ascorbic acid) can act as an antioxi- 
dant to reduce cancer. Pauling and his 
colleague Ewan Cameron reported that 
cancer patients given intravenous vitamin C 
(10 g/day) followed by oral delivery had an 
increased rate of survival (1). This led to two 
large clinical trials carried out by the Mayo 
Clinic in the late 1970s and mid-1980s (2, 3), 
which demonstrated that oral administra- 
tion of a high dose of vitamin C had no ef- 
ficacy as a cancer therapeutic. Furthermore, 
dietary antioxidants have failed as antican- 
cer agents in clinical trials (4). However, on 
page 1391 in this issue, Yun et al. (5) show 



“...the study...provides a 
mechanistic rationale for 
how vitamin C selectively 
kills...cancer cells.” 

that high doses of vitamin C selectively kill 
colorectal cancer cells carrying activating 
mutations in the oncogenes KRAS or BRAF, 
which are often refractory to approved tar- 
geted therapies. 

Vitamin C is taken up by cells through 
sodium-dependent vitamin C transporters, 
whereas the oxidized form of vitamin C, de- 
hydroascorbate (DHA), moves into cells via 
glucose transporters such as GLUT1 (5, 6). 
Once inside the cell, DHA is reduced back 
to vitamin C by glutathione (GSH), which 
consequently becomes oxidized glutathione 
(GSSG). Subsequently, GSSG is converted 
back to GSH by reduced nicotinamide ad- 
enine dinucleotide phosphate (NADPH) (see 
the figure). High doses of vitamin C can in- 
crease the amount of reactive oxygen species 
(ROS) in cancer cells and exert antitumori- 
genic activity (7). However, the molecular 
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mechanisms by which vitamin C inhibits tu- 
morigenesis remained unknown. 

Yun et al. observed that vitamin C was oxi- 
dized to DHA in cell culture media lacking 
reducing agents, and was subsequently im- 
ported into human colon cancer cells harbor- 
ing oncogenic KRAS or BRAF mutations by 
GLUT1. Moreover, the authors showed that 
high doses of vitamin C that resulted in a 
peak plasma concentration of 30 mM when 
administered intraperitoneally reduced the 
intestinal tumor burden in mice bearing con- 
ditional oncogenic activating forms of Kras 
and adenomatous polyposis coli (Ape) muta- 
tions, but not in mice with the conditional 
Ape mutation alone. Tumors from the double 
conditional Kras and Ape mutant mice ex- 
pressed more GLUT1 than tumors from mice 
harboring only the Ape mutation. Further- 
more, the pan-antioxidant TV-acetylcysteine 
prevented the vitamin C-mediated decrease 
in tumor burden, indicating that elevated 
ROS amounts due to vitamin C were respon- 
sible for reducing tumorigenesis in vivo. 

Yun et al. report that the increased up- 
take of DHA into the KRAS or BRAF mutant 
cancer cells, which express more GLUT1 
compared to normal cells, leads to the rapid 
conversion of DHA to vitamin C, resulting in 
the depletion of GSH and NADPH and an in- 
crease in ROS. Metabolite analysis revealed 
an increase in glucose carbons into the oxi- 
dative pentose phosphate pathway, a major 
cytosolic mechanism to generate NADPH. 
Indeed, increases in ROS have been shown 
to activate the oxidative pentose phosphate 
pathway within minutes (8). Additionally, 
metabolite analysis revealed an increase 
in glycolytic intermediates such as glycer- 
aldehyde-3-phosphate (G3P) upstream of 
glyceraldehyde 3-phosphate dehydrogenase 
(GAPDH). Concomitantly, Yun et al. observed 
a decrease in metabolites downstream of the 
GAPDH reaction, suggesting GAPDH inhibi- 
tion by ROS as a result of the increased DHA 
flux into KRAS or BRAF mutant colon cancer 
cells. GAPDH is a redox-sensitive protein, as 
its active-site cysteine residue can be targeted 
by ROS. Inhibition of GAPDH decreases the 
generation of glycolytic adenosine 5 '-triphos- 
phate (ATP) and pyruvate, a major substrate 
required to drive ATP production in the mi- 
tochondria. However, pyruvate supplementa- 
tion can rescue the cell death and energetic 
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Vitamin C and cell metabolism. The uptake of vitamin C by KRAS and BRAF mutant colon cancer cells is shown. 
The resulting increase in ROS production blocks glucose metabolism (in addition to other effects shown). Cells shift 
their glycolytic flux into the pentose phosphate pathway (PPP), but ultimately, the cells become depleted of ATP, 
inducing an energetic crisis that leads to cell death. 



crisis caused by GAPDH inhibition. Thus, 
these results suggest that high doses of vita- 
min C impair glycolysis and could be com- 
bined with the antidiabetic drug metformin, 
which can also diminish tumor burden by 
inhibiting mitochondrial complex I (9). 

In addition to oxidizing GAPDH, the 
elevated ROS amounts induced by vita- 
min C cause DNA damage, resulting in 
poly(ADP-ribose) polymerase (PARP) activa- 
tion and NAD + consumption. Notably, the 
GAPDH enzymatic reaction utilizes NAD + 
to convert G3P to 1,3-bisphosphoglycerate 
(1,3BPG). Therefore, the decrease in NAD + 
due to PARP activation further diminishes 
the GAPDH reaction. Inhibition of PARP 
or administration of nicotinamide mono- 
nucleotide, a precursor of NAD + synthesis, 
partially rescued cell viability after vitamin 
C treatment in vitro. Collectively, these find- 
ings suggest that in KRAS and BRAF mutant 
cells, vitamin C-induced endogenous ROS 
inhibits the GAPDH reaction directly (oxidiz- 
ing GAPDH) as well as indirectly (reducing 
the NAD + pool), leading to an energetic crisis 
that triggers cell death. 

High GLUT1 expression alone, however, 
does not make a cell more susceptible to 
vitamin C cytotoxicity. Wild-type KRAS and 
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BRAF colon cancer cells overexpressing 
GLUT1 were resistant to vitamin C-induced 
cell death, implying that oncogenic KRAS- or 
RRAF-induced metabolic reprogramming, in 
addition to high GLUT1 expression, is needed 
for toxicity. One metabolic liability of onco- 
genic AJMS-driven tumors is their increased 
rate of mitochondrial and cytosolic NADPH 
oxidase-generated ROS compared to wild- 
type cells, which initiate localized signaling 
pathways necessary for tumor cell prolifera- 
tion and tumorigenesis (10). Consequently, 
these cancer cells increase their antioxidant 
defense system by up-regulating the expres- 
sion of the transcription factor nuclear factor 
(erythroid-derived 2) related factor-2 (NRF2) 
to buffer the accumulation of ROS and pre- 
vent damage (4). The impairment of NRF2 or 
disabling antioxidant proteins in oncogenic 
ATMS-driven cancer cells would allow for ex- 
cessive amounts of ROS to accumulate and 
incur cell death, resulting in reduced tumor 
growth (11, 12). The results by Yun et al. are 
consistent with previous findings that in- 
creasing endogenous ROS with high doses of 
vitamin C reduces the tumor growth of on- 
cogenic KRAS-driven pancreatic cancer cells 
(13). Thus, tumors that exhibit a high rate 
of ROS generation coupled with increased 



GLUT1 expression are likely to benefit from 
treatment with a high dose of vitamin C. 

An important difference between the 
study by Yun et al. and many previous stud- 
ies is the mode of vitamin C delivery. Oral 
ingestion of high concentrations of vitamin 
C, 100 times the recommended dietary allow- 
ance, rarely exceeds a plasma concentration 
greater than 200 pM due to limited absorp- 
tion and renal excretion. By contrast, vitamin 
C administered intravenously can reach a 
plasma concentration of up to 10 mM and is 
safe in humans. Yun et al. injected vitamin 
C intraperitoneally in mice that reached 
millimolar concentrations in the plasma. 

A recent pilot phase Ha clinical trial using 
intravenous administration of vitamin C in 
conjunction with conventional paclitaxel- 
carboplatin therapy demonstrated a benefit 
in a small number of patients (14). Further- 
more, metastatic tumor cells can survive the 
hostile oxidizing environment of the blood 
by increasing their antioxidant defenses (15). 
Thus, infusion of vitamin C may be an effec- 
tive therapeutic strategy to induce the cell 
death of circulating metastatic tumor cells. 

One drawback of intravenous administra- 
tion of vitamin C is that patients will have to 
visit the clinic for vitamin C infusions daily 
for months. However, the development of a 
new oral formulation of vitamin C that can 
achieve high plasma concentrations may cir- 
cumvent this concern. Nevertheless, the study 
by Yun et al. provides a mechanistic rationale 
for how vitamin C selectively kills KRAS and 
BRAF mutant colorectal cancer cells. These 
findings warrant high-dose vitamin C clini- 
cal trials with selectivity for patients with a 
high GLUT1 expression combined with KRAS 
or BRAF oncogene-induced metabolic repro- 
gramming. After all these years, it seems that 
Pauling may have been correct on the use of 
high doses of vitamin C for cancer therapy 
but for the wrong reasons— not as an antioxi- 
dant, but as a pro-oxidant anticancer agent. ■ 
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STEM CELLS 

What is an adult stem cell 7 

Tissues may use diverse mechanisms to replace lost cells 



By Hans Clevers 

T he paradigm of the hematopoietic 
stem cell (HSC) has become deeply 
engrained in our minds, ever since 
bone marrow transplantation tech- 
nology narrowed in on this almost 
magical multipotent entity that gives 
rise to all blood cells. The HSC’s ability to 
“self-renew” as well as to proceed down hi- 
erarchical differentiation pathways involves 
a rigidly choreographed flow of events. The 
HSC paradigm currently serves as a tem- 
plate to interpret experimental observa- 
tions on any other mammalian tissue. Yet, 
it is not obvious why evolution would have 
come up with the very same solution for the 
renewal of all tissues. Attempts to fit obser- 
vations on solid tissues into the HSC hier- 
archy mold have led to confusing theories, 
terminologies, experimental approaches, 
and heated debates, many of which remain 
unresolved. Organs differ in size, archi- 
tecture, and function, and are subject to 
markedly different biological and physical 
challenges. It therefore appears plausible 
that tissues, with their different regenera- 
tive demands, have evolved different ways 
to restore cell numbers. 

After the atomic bombing of Hiroshima, 
experiments in irradiated mice established 
that bone marrow transplantation pro- 
tected against the drop in the number of 
white blood cells. Therapeutic bone mar- 
row transplantation technology allowed 
scientists to reduce a qualitative phenom- 
enon— the rescue of blood cell production 
by foreign bone marrow— to the properties 
of a physical entity, the HSC. Development 
of a surrogate, quantitative assay for HSCs 
in vivo (the spleen focus-forming assay) was 
followed by various in vitro assays for clo- 
nogenic HSCs and lineage-restricted pro- 
genitors. All of this has led to a consensus 
on the defining characteristics of HSCs and 
the differentiation hierarchy it fuels. 

The HSC is rare and it divides infre- 
quently (it is “quiescent”), as DNA repli- 
cation carries the risk of mutation. HSC 
division occurs asymmetrically, yielding one 
actively dividing daughter progenitor and a 
new, quiescent stem cell. The HSC has the 
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unique, defining capacity to self-renew and 
is therefore long-lived. Dividing progeni- 
tors migrate down the hierarchy to become 
progressively lineage-restricted, eventually 
yielding the various mature blood cells. 
Thus, cells flow unidirectionally through 
the system, away from the HSC. These in- 
sights were obtained decades before tissue- 
renewal in other mammalian tissues could 
be studied. What have we learned since? 

DISTRIBUTED STEM CELL FUNCTION. The 

development of cell culture technologies 
in the 1970s allowed extensive expansion 
of proliferative epidermal cells into large 
sheets. As autologous transplants, these 
sheets have saved the lives of many burn 
patients, a dramatic demonstration of stem 
cell potential. Yet to date, the identity of the 
epidermal stem cell has remained contro- 
versial. It gives rise to a single cell type— the 
keratinocyte— which disqualifies it as stem 
cell, as it is not multipotent. But behind this 
semantic issue, a deeper problem exists. 



"... the search for stem cells as 
a physical entity may need 
to be replaced by the search 
for stem cell function...” 

It has remained impossible to pinpoint a 
rare, quiescent stem cell, hidden between 
the abundance of proliferative progeni- 
tors that constitute the epidermal stratum 
basale. Indeed, random labeling of these 
proliferative cells yields epidermal clones 
of constantly changing shapes and sizes, 
which can persist lifelong (1). 

Similar situations occur in the esophagus 
( 2 ), intestinal crypts ( 3 ), stomach glands ( 4 ) 
and the testis (5). These tissues harbor large 
populations of proliferating stemlike cells 
of equal potency, each intrinsically capable 
of generating a long-lived lineage of cellu- 
lar offspring. These constantly proliferating 
stem cells compete for niche space. Chance 
decides which cell will persist long-term; 
all others will eventually disappear. These 
active stem cells do not fit the defining 
characteristics of the HSC: They are abun- 
dant (not rare), cycling (not quiescent), 
mostly divide symmetrically (not asym- 



metrically), and their life span is not pre- 
determined but subject to stochastic events. 
Thus, while these tissues harbor an explicit 
tissue-renewal function, this function is dis- 
tributed over a large cell population. At the 
single-cell level, there exists no rare, hard- 
wired stem cell with the prototypic stem 
cell characteristics. 

QUIESCENCE. This state of nonprolifera- 
tion, experimentally defined by the ability 
to retain chemical DNA labels, is widely 
used as a defining stem cell characteristic. 
Yet, the overwhelming majority of our cells 
are nondividing, rendering quiescence a 
rather nondiscriminatory parameter. Fur- 
thermore, it appears somewhat counter- 
intuitive that cells whose only raison d’etre 
is the generation of daughter cells, would 
rarely divide. 

Good examples of quiescent stem cells 
do exist. The first stem cell to be defined 
in the hair follicle bulge is quiescent ( 6 ). 
Striated muscle contains the poster child 
of quiescence, the satellite cell (7). Healthy 
muscle fibers are large, long-lived, nonpro- 
liferative syncytia. Closely apposed to these 
fibers are small, nondividing cells barely 
larger than a nucleus. These satellite cells 
lay dormant for years, called into action 
only upon muscle damage. By asymmetric 
cell division, the satellite cell renews itself 
while also creating its offspring, the con- 
tractile muscle fiber. 

In the intestinal crypt, the continuously 
proliferating Lgr5 stem cells (see the figure) 
are located at the base, while a “reserve” 
quiescent stem cell resides at position +4 
counted from the crypt base ( 3 ). In one 
view, these cells occupy the apex of the hi- 
erarchy, giving rise to the proliferative Lgr5 
cells. Recent genetic tracing studies have 
unveiled an alternative mechanism: +4 
cells represent daughters that have exited 
the cell cycle to terminally differentiate 
into secretory cells ( 8 , 9 ). Acute loss of the 
Lgr5 stem cells can coax +4 cells to settle 
at the crypt base and reacquire a multipo- 
tent Lgr5 stem cell phenotype. Individu- 
ally, these quiescent +4 cells are short-lived 
as they will differentiate and die. But as a 
population, these transiting cells serve as a 
reserve stem cell pool by violating the rule 
that the flow through a stem cell hierarchy 
should be unidirectional. 

PLASTICITY. The +4 secretory precursor 
may be said to exhibit plasticity. Similar ob- 
servations have been made in other tissues. 
Thus, luminal secretory cells of the airway 
epithelium revert to a multipotent stem cell 
state when the pool of basal stem cells is lost 
( 10 ). Terminally differentiated, pepsinogen- 
producing chief cells located at the stomach 
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gland base revert at an inconspicuously slow 
rate to a multipotent stem cell fate, a pro- 
cess that strongly accelerates upon loss of 
the proliferative stem cell pool (11). In these 
examples, “quiescent” cells play a crucial 
role in tissue renewal upon damage. In their 
unperturbed state, however, these cells have 
ceased to divide while differentiating, not 
because they are stem cells. 

COEXISTING STEM CELL TYPES. The adult 
hair follicle is home to a host of stem cell 
types, each in its distinct location ( 6 ). Dur- 
ing hair follicle maintenance, these stem 
cells only produce the specialized cell types 
required at their specific location. But upon 
damage, they reveal a remarkable level of 
plasticity by generating almost any compo- 
nent of the epidermis. Two classic studies on 
mammary stem cells have followed the HSC 
paradigm. Transplantation of single sorted 
mammary epithelial cells into their niche, 
the fat pad, have identified rare cells that, 
on their own, build a complete, functional 
mammary gland, consisting of the two main 
layers: the basal (or myoepithelial) layer 
and the luminal layer ( 12 , 13 ). Although 
studies using genetic marking in vivo have 
confirmed the presence of such multipotent 
stem cells [e.g. ( 14 , 15)], another study has 
demonstrated that the basal and luminal 
lineages are maintained by independent 
stem cells ( 16 ). A similarly confusing situ- 
ation exists in the prostate stem cell field, 
where multipotent stem cells have alterna- 
tively been reported to reside in the basal 
or in the luminal layer, while a recent study 
indicates that basal and luminal lineages are 
maintained over the long term by indepen- 
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dent stem cells ( 17 ). None of these observa- 
tions agree easily with a classically designed 
stem cell hierarchy. 

RENEWAL BY DIFFERENTIATED CELLS. 

The liver is the “champion” of mammalian 
tissue regeneration. Uponhemihepatectomy, 
hepatocytes in the remaining liver enter the 
cell division cycle without dedifferentia- 
tion. Within weeks, original liver mass and 
function are restored, upon which all cells 
return to their nonproliferative state. Few 
would use the term “self-renewal” for the 
rapid replacement of surgically removed 
hepatocytes by remaining hepatocytes ( 18 ). 
An alternative, more mundane mechanism 
of liver repair becomes active when noxious 
agents (such as toxins or viruses) simultane- 
ously affect the health of all hepatocytes. 
Under these circumstances of global dam- 
age, cells from the bile duct tree revert to 
a bipotent stem cell state (“oval cells”) ca- 
pable of regenerating hepatocytes and bili- 
ary cells ( 18 ). Great efforts have been made 
to identify “professional” stem cells— cells 
with no other function— in the liver, but it 
may well be that the healthy liver does not 
contain such stem cells and that it relies 
entirely on the proliferative capacity of the 
two differentiated cell types. Similar con- 
siderations may apply to other internal or- 
gans that exhibit little proliferative activity 
in the healthy state, such as the pancreas 
or the kidney. 

HOW TO DEFINE STEM CELL FUNC- 
TION? Central to the HSC paradigm is 
the concept that the maintenance and 
restoration of blood cell numbers are ex- 



clusively executed (and can be exclusively 
transferred) by a discrete, rare, physical 
entity, the HSC. The success and the es- 
thetic appeal of the HSC paradigm have 
been overwhelming. Subsequent studies in 
other tissues have almost inescapably been 
devoted to discovering an HSC equivalent. 
Discussions over the general applicability 
of the HSC paradigm as a stereotypic tem- 
plate have essentially not occurred. The 
examples given here present some of the 
best-studied models of mammalian tissue 
renewal. None of these match easily to the 
HSC paradigm. Rather, with some notable 
exceptions such as the muscle satellite cell 
or the bulge stem cell, the search for rare, 
hard-wired professional stem cells has 
been futile in most tissues. 

In a more open approach, the focus on 
the search for stem cells as a physical en- 
tity may need to be replaced by the search 
for stem cell function, operationally de- 
fined as the ability of an organ to replace 
lost tissue. An opportunistic experimental 
starting point should probably focus on 
those cell populations that exhibit pro- 
liferative activity during tissue renewal 
and on the origins of those cells. Stem 
cell function may indeed be embodied in 
hard-wired professional stem cells in some 
tissues. But stem cell function may also 
diffusely be contained within much larger 
populations of neutrally competing, un- 
differentiated cells. It may be executed by 
facultative stem cells, opportunistically re- 
cruited from committed or even from fully 
differentiated cell populations. Tissues 
might avoid the use of undifferentiated 
stem cells altogether, by allowing differen- 
tiated cells to proliferate directly and thus 
to replace lost tissue. In the end, there may 
be no general rules as to how tissues are 
renewed, as there is no end to the inven- 
tive power of evolution. 
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CHEMISTRY 

lit the wake of collision 

When atoms collide with metal surfaces, electron-hole 
pair excitations dissipate the adsorption energy 



By Harald Brune 

W hen a metal atom with thermal ki- 
netic energy hits a metal surface, 
it adsorbs on it with a probability 
of one. Moreover, it sticks to its 
impact site and makes not even a 
single jump to neighboring lattice 
sites (i). The mechanism behind this observa- 
tion is not fully understood. On page 1346 of 
this issue, Bunermann et al. study a simpler 
system that can help to elucidate the metal/ 
metal case. In their study, energetic hydrogen 
atoms impact on a gold surface. Only a frac- 
tion of the H atoms adsorb on the surface 
(see the figure), but the reflected atoms still 
lose a substantial fraction of their incident ki- 
netic energy. The authors unequivocally iden- 
tify electron-hole pair excitation as the main 
dissipation mechanism in these collisions (2). 
Apart from its obvious model character, the 
H/metal system is important for fields as di- 
verse as heterogeneous catalysis, interstellar 
H 2 production, and H storage. 

For the metal/metal case, field ion micros- 
copy studies have shown that the adsorption 
energy is dissipated instantaneously. Atoms 
deposited at low temperature (20 K) onto a 
close-packed single crystal surface populate 
two nonequivalent adsorption sites with 
equal probability, even when the two sites 
have slightly different binding energies (3). If 
the atoms were able to make only one lateral 
jump, the higher binding energy site would 
be occupied more often than the low-energy 
one. When the surface is heated to 45 K, 
thermal diffusion leads to the exclusive oc- 
cupation of the energetically more favorable 
site. The complete absence of transient mo- 
bility at low temperature requires the entire 
adsorption energy— on the order of several 
electron volts (eV)— to be dissipated instan- 
taneously. It is unlikely that collective lattice 
vibrations are fast enough to dissipate this 
energy. As in the H/metal system, electroni- 
cally nonadiabatic effects, such as electron- 
hole pair excitations, are thus likely to play 
the dominant role. 

In a binary collision of H and Au atoms, 
only 2% of the kinetic energy is transferred 
from H to Au. This is a result of the very dif- 
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ferent masses of both atoms. For the incident 
kinetic energy chosen in Biinermann et al.’s 
experiment, the expected energy loss from 
a binary ballistic collision on Au is 56 meV. 
However, the observed energy loss is almost 
20 times higher, and the energy-loss distri- 
bution is much broader. It is centered at 910 
meV and the maximum energy loss is larger 
than 2.0 eV, explaining why some atoms 
stick and therefore why this reaction takes 
place at all. 

To illustrate the role of electron-hole pair 
excitation, the authors compare the reaction 
on a clean Au surface with one that has a 
thick layer of solid Xe adsorbed onto it. On 
Xe, the binary collision model gives an en- 
ergy loss of 83 meV, and the observed loss is 
comparable (46 meV). Therefore, the H-Xe 



A (^) Metal atom 
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Surface collisions. (A) When metal atoms collide with a 
metal surface with very small incident kinetic energy, every 
atom sticks. (B) For molecule-metal collisions, electron- 
hole pair excitations are the main mechanism through 
which adsorption energy is dissipated. (C) Hydrogen/ 
metal collisions are similar to metal/metal ones; if the 
incident kinetic energy is small, every atom would probably 
stick. At the higher energies used by Bunermann et al., 
only very few stick, making it possible to measure the 
energy of the backscattered atoms and to determine how 
much energy they have lost. The authors show that as for 
molecule/metal collisions, electron-hole pair excitation is 
the dominant mechanism for energy dissipation. 
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collision can be treated as a ballistic collision, 
but the H-Au collision clearly cannot. 

Molecular dynamics calculations without 
electronic excitations essentially yield the 
ballistic loss spectrum that is in disagree- 
ment with the scattering data from Au(lll). 
Once electron-hole pair creation is included, 
the calculations give excellent agreement 
with experiment. Similar to the experiment, 
these simulations are very demanding, be- 
cause many trajectories have to be consid- 
ered on a high-dimensional potential energy 
surface. The beauty of H as adsorbate is that 
it reduces the complexity because there is 
only translational kinetic energy; no rota- 
tions, vibrations, or steric effects such as ori- 
entations of the molecular axes need to be 
considered. The theory uses the ab initio po- 
tential energy surface in classical molecular 
dynamics simulations and applies the known 
scheme to introduce electronic friction (4). It 
thus has no adjustable parameters. 

The interactions of molecules with metal 
surfaces are much more complicated because 
the molecules can dissociate, there might 
be activation barriers for this dissociation, 
and steric effects can play a role. However, 
energy dissipation in nondissociative mo- 
lecular adsorption is in a way simpler than 
in atomic adsorption (5). The molecule gets 
distorted when binding to the surface starts 
to become significant. For example, near a 
surface, the bond in a simple diatomic mol- 
ecule gets weakened and therefore stretched, 
thereby exciting the molecule’s vibrational 
degrees of freedom. From this excited state, 
the energy can very efficiently be transferred 
to electronically nonadiabatic behavior, such 
as electron-hole pair excitation in the metal. 

The central insight from the work of 
Bunermann et al. is that translational motion 
can be damped in the same way as intramo- 
lecular excitations (6). This is important in 
the energy dissipation of hot reaction prod- 
ucts (7, 8). Electron-hole pair creation is also 
behind regular dry friction, and its effect can 
be suppressed by putting one of the sliding 
bodies into a superconducting state (9). Elec- 
tron-hole pair excitations control the time 
and energy scale of surface chemical reac- 
tions, obviously also for the most elementary 
one of H on Au(lll). ■ 
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CHEMISTRY 

A kid’s guide to chemistry 

The classic chemistry set gets a 21st-century upgrade 



By Julia Fahrenkamp-Uppenbrink 

C hemistry sets used to be all the rage, 
but today, children are more likely to 
be found at their computers or tablets 
than doing experiments. Furthermore, 
existing sets tend to present chemis- 
try as miraculous, without explaining 
the observations in any detail. A new take 
on the old staple is trying to change that. 
MEL Science’s chemistry set currently tar- 
gets the home-schooling market, providing 
a starter kit and 36 individual experiment 
sets intended for children aged 12 years and 
older. The experiments are sent out in packs 
of three over the course of 12 months, and 
can currently be shipped to addresses in the 
United States, United Kingdom, and Rus- 
sia. The kits were developed by a group of 
self-described “science geeks,” with the help 
of schoolchildren and teachers at a London 
secondary school to make sure that the sci- 
ence is presented at an appropriate level. 
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For this review, my daughter and her 
school friend first picked the “Colorful chem- 
istry” set. The set contains two experiments 
that were easy to perform and that worked 
exactly as expected. In one experiment, ad- 
dition of glycerol to a potassium perman- 
ganate solution changes its color gradually 
from purple to orange as the permanganate 
reacts with the glycerol. In the other, the thy- 
mol blue dye changes the color of a solution 
depending on its acidity. My daughter made 
the connection to an acid-base chart from 
her science class and was thrilled to have 
recreated it herself. 

For our second session, two other friends 
joined us, as word had got around at school 
about the cool new chemistry set being tested. 
This time, we made a tin “hedgehog” by plac- 
ing a zinc pellet in a tin chloride solution. We 
watched little spikes grow all over the pellet 
as the zinc reacted with the tin chloride and 
the tin precipitated from the solution. In the 
second experiment, we watched tin dendrites 
grow in tin chloride solution exposed to an 
electric current. The children were very im- 
pressed by the results. From start to finish, 
each set took about 1 hour to complete. 



MEL Chemistry 
Starter kit: $73.80, 
individual sets: $11.95 

https://melscience.com 







Each set comes with detailed, easy-to- 
follow instructions and brief explanations, 
including the main chemical equations 
that underlie the observations. The ones we 
tested were meticulously prepared and of 
high quality. I loved the level of attention to 
detail: For example, a paper clip provided for 
pricking the tops of reagent bottles comes 
in its own little bag, pre-bent to allow easy 
pricking. With only a few exceptions— such 
as calling one experiment “Magic liquid” and 
one set “Chemistry of monsters”— they are 
refreshingly free of silliness. 

The sets are designed to work with 
smartphones in a number of ways that are 
likely to make them fascinating for young 
people. For example, we had fun taking 
close-up pictures of the tin hedgehog with 
the smartphone macro lens provided in 
the starter kit. There is also a dedicated 
phone and tablet app that provides more 
detailed explanations for each experiment. 

The explanation for the tin set was detailed 
and well presented. The explanations for 
the colorful chemistry set were more lim- 
ited in the trial kit that we received, but 
when I followed up with the kit’s cre- 
ators, I learned that they have since been 
updated. 

The app also shows the chemical struc- 
tures of all reagents, including stereoviews 
of stick and space-filling models. A stereo- 
viewer is provided as part of the starter kit, 
and although it took some effort to get it 
to work, it did impress the young people. 
This functionality illustrates the ambition 
of the project, which goes well beyond a 
traditional chemistry set. 

At times during our sessions, I did 
feel slightly overwhelmed trying to use 
a phone, tablet, and stereoviewer while 
simultaneously supervising the experi- 
ments, but I quickly relaxed as I saw the 
children enjoying themselves. The app is a 
great resource for follow-up questions, and 
the reagent viewing can also be left to a 
later date. 

Having done two sets of experiments, I 
feel we have only just scratched the surface 
of what the set has to offer. My daughter 
and her rapidly growing group of chemis- 
try enthusiast friends are looking forward 
to exploring chemical erasers and the zinc- | 
carbon battery set next. I will leave the chem- § 
ical monsters to last. §! 

o 

o 
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EXHIBITION 



The enchantress of numbers 

Two hundred years after her birth, a new exhibition 
explores the life of Ada Lovelace 



By Andrew Robinson 

» ny visitor to the London Library 
inevitably walks past a commemo- 
rative blue plaque attached to a 
neighboring house in which once 
lived the remarkable Ada, Count- 
ess of Lovelace (1815-1852). Ada 
Lovelace is well known for three reasons. 
She was the daughter of the Romantic 
poet Lord Byron, who separated 
from her mother when Ada was 
just 1 month old. (Despite his 
early abandonment, she ulti- 
mately chose to be buried next 
to Byron in the family tomb.) 

She was also the first person to 
discuss the concept of program- 
ming a computer: In the 1840s, 
she issued an extensive and far- 
sighted commentary on a cal- 
culating machine known as the 
Analytical Engine, created by 
the mathematician and inventor 
Charles Babbage. 

Those who are unfamiliar 
with Lovelace’s biography may 
recognize her name nonetheless: 

In 1980, a high-level computer 
programming language, Ada, 
was created for the U.S. Depart- 
ment of Defense. The language 
is still widely used today. With 
the recent establishment of Ada 
Lovelace Day on 13 October, she 
has become something of an 
iconic figure for those whose 
goal is to increase the profile of 
women in science, technology, 
and mathematics. 

For the bicentenary of her 
birth, the Science Museum in 
London, which holds Babbage’s 
Difference Engine and Analyti- 
cal Engine, is displaying a small 
but intriguing exhibition about Lovelace’s 
brief life and mathematical work. In addi- 
tion to a prototype of the Difference En- 
gine and the unfinished Analytical Engine, 
the exhibition includes a model of a Jac- 
quard loom. Whereas the punch cards of 
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the loom wove flowers and leaves, those 
of the engine would weave algebraic pat- 
terns, she once observed. 

The exhibition also features two color 
portraits of Lovelace, each depicting her 
as a fashion-conscious young lady. She was 
not overly fond of either portrait; one of 
them prompted her to comment that her 
jaw looked large enough to write the word 
“mathematics” on it. 







ui 
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Ada Lovelace 
Science Museum, London 

Through 31 March 2016 







According to Lovelace, this portrait, painted in 1836, rendered her jaw large 
enough to write the word “mathematics” upon. 



Originals of her letters from the collec- 
tions of the British Library and the Bodle- 
ian Library appear alongside a digital 
display, which allows the visitor to follow 
the progress of her work and her interac- 
tions with Babbage, Michael Faraday, and 
others. 

Speaking of the Analytical Engine, she 
once pointed out that, because the engine 
generates numerical data, many people 



might mistakenly assume that it is only 
useful for arithmetic. On the contrary, 
she argued, “The engine can arrange and 
combine its numerical quantities exactly 
as if they were letters or any other general 
symbols; and in fact it might bring out its 
results in algebraical notation, were provi- 
sions made accordingly.” 

In a remark that presciently foreshad- 
ows the digitization of music, she goes on: 
“Supposing, for instance, that the funda- 
mental relations of pitched sounds in the 
science of harmony and of musi- 
cal composition were susceptible 
of such expression and adapta- 
tions, the engine might compose 
elaborate and scientific pieces of 
music of any degree of complex- 
ity or extent.” 

Several historians of comput- 
ing, including Doron Swade, one 
of the advisers to the exhibition, 
have observed that Lovelace had 
a rather high opinion of her own 
intellect (i). For example, in 1843 
she informed Babbage that “the 
more I study, the more insatiable 
do I feel my genius for it to be.” 
Comparing herself with her fa- 
ther, she wrote: “I do not believe 
that my father was (or ever could 
have been) such a Poet as I shall 
be an Analyst (& Metaphysi- 
cian).” The exhibition organizers 
downplay this conceit, diplo- 
matically stating that “Her dis- 
tinctive mixture of mathematical 
analysis and unbridled imagina- 
tion brought unique insights into 
what the Analytical Engine could 
achieve.” 

Sadly, because Babbage never 
completed the engine, for lack 
of public funding, we shall never 
know what it might have pro- 
duced. And since Lovelace died 
from cancer in her 30s, her 
potential, too, went unfulfilled. But de- 
spite her untimely death, her legacy as a 
“prophet of the computer age” is very much 
alive today. 
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Edited by Jennifer Sills 

Flying foxes face cull 
despite evidence 

FLYING FOXES (PTEROPUS niger, also known 
as Mauritius fruit bats) are currently listed 
as Vulnerable by the World Conservation 
Union (IUCN) (1). Originally inhabitants of 
all three Mascarene Islands, the P. niger pop- 
ulation was driven extinct on Reunion and 
Rodrigues, leaving only the population on 
Mauritius ( 2 ) [although a few animals did 
stray from Mauritius to Reunion recently 
(1)]. Mauritius, meanwhile, has lost two 
of its three flying fox species as a result of 
habitat destruction and hunting by humans 
(2, 3). Yet, disregarding scientific evidence, 
the Parliament in Mauritius recently decided 
to cull thousands of the island’s remaining 
flying foxes, which consume fruit and nectar, 
in an effort to increase the profits of fruit 
producers (4). The IUCN warned that the 
cull would force an immediate reassessment 
of the species’ Red List Category, possibly 
even to Critically Endangered, damaging 
Mauritius’s reputation ( 3 ). The IUCN, Bat 
Conservation International, and others 
have called for an evidence-based decision 
instead, but so far in vain (3, 5). 

To justify the cull, the government is 
using exaggerated figures of population size 
and damage to fruits (3-5). The Honorable 
Minister has cited a population ranging 
from 90,000 (4) to more than 1 million fruit 
bats (6). Research by experts indicates that 
the population is only 50,000 (3, 4). If the 
government uses its own estimates instead 
of scientific evidence to determine the num- 
ber of fruit bats to cull, the targeted 20% 
could amount to 18,000 fruit bats (or more, 
if higher estimates are used). This would 
not be 20%, but closer to 40% of the actual 
population. Worse, the cull is scheduled to 
take place at the onset of the cyclonic season. 

Other factors, including impacts of inva- 
sive alien species (e.g., rats, parakeets, and 
w mynahs), destroy more fruits than flying 

0 foxes (7), but are not targeted by the cull, 
g casting further doubts on its effectiveness. 

§ Aside from elevating the species’ extinction 
§ risks, the cull would exacerbate the already 
g critical situation of Mauritian biodiversity 

1 (8), because flying foxes are the sole surviv- 
| ing seed disseminator for several important 
< native tree species (9-11). 

| Mauritius should not allow its fruit 
g production and export industry to damage 
l threatened species and the environment. 



The international community can help 
encourage Mauritius to reinstate evidence- 
based decisions and stop the cull. 

F. B. Vincent Florens 

Faculty of Science, University of Mauritius, Reduit, 
Mauritius. E-mail: Vin.Florens@uom.ac.mu 
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LIFE IN SCIENCE 

The Force of ethics awakens 



/ / ■ ft ■ ould creating an army of clones to battle droids be ethically accept- 
B B 1 1 1 able? As the Chancellor Palpatine suggested to Anakin Skywalker, 
IIh could creating life or indefinitely prolonging life be considered 
IV H morally legitimate? If so, then why did the Jedi’s moral code strictly 
■ ■ prevent all forms of research on altering life itself and ban the acquisi- 
tion of such knowledge?” I pause to allow my students to absorb these questions. 
They regard me, wide-eyed. 

Summoning Master Yoda, Obi-Wan Kenobi, or even— may the Force protect us!— 
the Emperor in a class on the ethics of biomedical research might at first sound a 
bit out of place. But upon closer look, the Jedi and the Sith could well be welcome 
there. Star Wars presents an almost continuous collection of life-related moral dilem- 
mas, each of them serving as a perfect starting point for discussions in a biomedical 
classroom. From massive cloning and species destruction or creation, to mind-control 
and knowledge being kept by (and for) a limited elite, Star Wars provides examples 
for almost all of the challenges we want students to consider. The vast majority of 
students are highly familiar with the popular culture phenomenon that is Star Wars, 
and they are motivated to actively engage in the debate process. 

With biomedical science moving so fast that it might sound like fiction, one of our 
missions is to prepare students— the future scientists— for the ethical challenges they 
might encounter in their career. Popular culture is just a tool we can use to approach 
such essential questions. Still, awakening the Force in the classroom might help our 
students avoid becoming seduced by the Dark Side. 

MatthieuJ. Guitton 
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Marine Protected Areas 
miss the boat 

IN THEIR POLICY Forum “Making waves: 
The science and politics of ocean protec- 
tion” (23 October, p. 382; Published online 
15 October 2015), J. Lubchenco and K. 
Grorud-Colvert discuss recent progress in 
establishing and enforcing Marine Protected 
Areas, in which extractive activities are 
illegal. We should make sure that the focus 
on Marine Protected Areas does not divert 
attention from other strategies of ocean 
protection. The major threats to the ocean 
are global warming, ocean acidification, pol- 
lution, illegal fishing, land-based runoff of 
sediments, and plastics (1, 2). Other threats 
include noise pollution and whale collisions 
resulting from ship traffic (3, 4). Marine 
Protected Areas provide absolutely no pro- 
tection from any of these threats. The major 
“threat” they address is legal fishing, which 



is now well regulated in almost all of the 
areas where large Marine Protected Areas 
are being implemented (5) and unlikely to 
be effectively enforced where fisheries are 
not well regulated (5). 

RayHilbom 
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TECHNICAL COMMENT 
ABSTRACTS 

Comment on “Atlantic and Pacific 
multidecadal oscillations and Northern 
Hemisphere temperatures” 

S . Kravtsov, M. G. Wyatt, J.A. Curry, 

A. A. Tsonis 

Steinman et al. (Reports, 27 February 2015, 
p. 988) argue that appropriately rescaled 



multimodel ensemble-mean time series 
provide an unbiased estimate of the 
forced climate response in individual 
model simulations. However, their 
procedure for demonstrating the validity 
of this assertion is flawed, and the residual 
intrinsic variability so defined is in fact 
dominated by the actual forced response 
of individual models. 

Full text at http://dx.doi.org/10.1126/science. 
aab3570 

Response to Comment on “Atlantic and 
Pacific multidecadal oscillations and 
Northern Hemisphere temperatures” 

B.A. Steinman, L. M. Frankcombe, M. E. 
Mann, S. K. Miller, M. H. England 

Kravtsov et al. claim that we incorrectly 
assess the statistical independence of 
simulated samples of internal climate 
variability and that we underestimate 
uncertainty in our calculations of observed 
internal variability. Their analysis is 
fundamentally flawed, owing to the use of 
model ensembles with too few realizations 
and the fact that no one model can 
adequately represent the forced signal. 

Full text at http://dx.doi.org/10.1126/science. 
aac5208 
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TECHNICAL COMMENT 



CLIMATE CHANGE 

Comment on “Atlantic and Pacific 
multidecadal oscillations and 
Northern Hemisphere temperatures” 

S. Kravtsov, 1 * M. G. Wyatt, 2 J. A. Curry, 3 A. A. Tsonis 1 

Steinman et al. (Reports, 27 February 2015, p. 988) argue that appropriately rescaled 
multimodel ensemble-mean time series provide an unbiased estimate of the forced climate 
response in individual model simulations. However, their procedure for demonstrating the 
validity of this assertion is flawed, and the residual intrinsic variability so defined is in fact 
dominated by the actual forced response of individual models. 



T he central result of Steinman et al! s anal- 
ysis (1) is the demonstration of an apparent 
consistency among the responses of differ- 
ent models to variable forcing in the 20th- 
century climate simulations. In particular, 
they claim that regional multimodel ensemble- 
mean time series defines the universal forced 
signal, which can be linearly rescaled to provide 
unbiased estimates of the regional forced re- 
sponses for individual models. Such a consisten- 
cy is surprising because the models have different 
physical parameterizations and the simulations 
may use different forcing subsets. If their claim 
were true, it would add much confidence to the 
authors’ semi-empirical attribution of the ob- 
served multidecadal climate variability to the 
forced and intrinsic sources. However, the implied 
uniqueness of the forced signal defined by their 
regional regression method is an artifact of their 
analysis procedure, and the actual uncertainty of 
the semi-empirical estimates of the observed multi- 
decadal intrinsic variability is much larger than 
these authors have inferred. 

Consider M time series of length T, corre- 
sponding to M different climate simulations: 
x®\ m — \ —,M; t — 1, ..., T. Let the bar denote 
averaging across the time dimension it) and 
square brackets denote averaging across the 
ensemble member dimension (m). For example, 
the time mean of each ensemble member ~x^ and 
the ensemble average time series [as®] are de- 
fined as follows 

T 

°° m = T ^ W 

t = 1 
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M 

( 2 ) 

m=l 

Consider a decomposition of x® into the forced 
signal fm and residual intrinsic variability 4 

4=f®+4 ( 3 ) 

Without loss of generality, we can assume 
%m=fm = 0 , hence = 0 . If the estimated 
forced signal/i? is unbiased, then the time series 
4 and 4,2 of residual intrinsic variability in any 
pair of simulations m 1 and ra 2 must be uncorre- 
lated (independent). Furthermore, if the distri- 
bution of 4 has mean 0 and variance a 2 , the 
ensemble mean residual time series [e®] will have 
the distribution with mean 0 and variance a 2 /M. 
Hence, one can quantitatively assess the statis- 
tical independence of different realizations of sim- 
ulated intrinsic variability by comparing the actual 
dispersion [e] 2 of the ensemble mean time series 
[e^] with its theoretical prediction [e 2 ] /M, where 
we estimated a 2 ~ [e 2 ] . Large values of [e] 2 would 
indicate that assumption of statistical indepen- 
dence between different realizations of intrinsic 
variability 4 is violated due to biases in the esti- 
mated forced signal f® , so that at least a portion 
of the common true forced signal manifests in 
the estimated “intrinsic” residuals 4 • 

Steinman et al. considered, among others, the 
following two methods for estimating the forced 
signal, both based on the multimodel ensemble 
mean time series 

/« = [*<*>] (4A) 

f^=a m [x^} (4B) 

The differencing method (Eqs. 3 and 4A) simply 
identifies the forced signal with the multimodel 
ensemble mean [a?®]. The regression method (Eqs. 
3 and 4B) rescales the first-guess forced signal 
[a?^] for a given simulation by finding a m via 
least squares to minimize in Eq. 3. 



Steinman et al. further claimed that both of 
these methods provided independent realizations 
of residual intrinsic variability in climate-model 
simulations, based on the fact that the resulting 
variance [e] 2 of the ensemble mean residual time 
series was much smaller than the theoretical value 
of [e 2 ] /M. However, it is easy to show that, due 
to the choice of forcing derived using either Eq. 
4A or Eq. 4B, this ensemble mean residual time 
series is identically zero 

[e (t) J~0;t = l,...,r (5) 

and so is its variance [t] 2 = 0. Hence, the ex- 
treme smallness of the dispersion of ensemble 
average intrinsic variability attributed in (1) to 
the statistical independence of its different real- 
izations is actually an artifact of the algebraic 
constraint (Eq. 5) [see (2-5)]. This flaw does 
not mean that the residuals are necessarily cor- 
related (not independent), but a different test is 
required to determine that. 

We now show directly that the regional regres- 
sion approach (i) of defining the forced signal 
leads to the correlated samples of residual in- 
trinsic variability in the individual-model ensem- 
bles (subensembles of simulations using a single 
model with fixed physics package and an iden- 
tical forcing history). For these subensembles, it 
is the expression (Eq. 4A) that naturally gives an 
unbiased estimate of the forced variability. We 
considered 18 such subensembles from the Cou- 
pled Model Intercomparison Project Phase 5 
(CMIP5) models with four or more 20th-century 
simulations (5), totaling 116 individual simula- 
tions out of the 170 available simulations. The 
multimodel ensemble mean based on these 116 
simulations is nearly identical to the one com- 
puted using all of the available 170 simulations. 
We defined two alternative sets of the model- 
simulated intrinsic variability. In method A, we 
formed realizations of intrinsic variability by 
subtracting the 5-year low-pass-filtered ensem- 
ble mean of each model from this model’s indi- 
vidual simulations (i.e., Eq. 4A applied separately 
to individual model ensembles). The second set 
(method B) defined the residual intrinsic varia- 
bility using the forced signal estimated from re- 
gional multimodel regression (1) (i.e., Eq. 4B 
applied to the whole ensemble of 116 simulations). 

To quantify independence of different realiza- 
tions of intrinsic variability in the individual- 
model ensembles, we introduced an ensemble 
correlation measure C by summing positive cor- 
relations among all possible pairs of an individ- 
ual model’s M ensemble members 

v 1 m>l 

where H(x) is the Heaviside step function (7); 
the quantity C ranges from 0 (no positive cor- 
relations between individual ensemble mem- 
bers) to 1 (all ensemble members are perfectly 
correlated). The correlation measure (Eq. 6) was 
computed for raw and low-pass-filtered intrin- 
sic variability defined using methods A and B 
[Fig. 1, A to C shows results for the Geophysical 
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Fluid Dynamics Laboratory (GFDL) CM3 model; 
see (8)]. Method A produces intrinsic variability 
with C values well within the range expected from 
random uncorrelated red-noise samples generated 



GFDL-CM3 AMO corral alien btw individual runs (5) 




Averaging window size (yr) 



GFOL-CM3 PMO correlation blw individual runs (5} 
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using an autoregressive model of order 3 (AR-3) (9). 
In contrast, Steinman et al’s method B results in 
samples that are significantly correlated due to their 
systematic difference from the true forced signal. 
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We then used 18 versions of the forced signal, 
estimated by the unbiased method A, to isolate 
intrinsic variability in observed surface temper- 
atures via Eq. 3 and Eq. 4B (Fig. 1, D to F). The 
spread among the 18 estimates of intrinsic var- 
iability in observations is much larger than the 
tight bootstrap-based error bounds on the semi- 
empirical estimates of the observed intrinsic 
variability in figure 3 in (1). Hence, the actual 
uncertainty of the semi-empirical attribution 
by SMM is also much larger (10), thereby pre- 
venting any clear inferences about the cause of 
the “false pause” in the global warming (11, 12). 

REFERENCES AND NOTES 

1. B. A. Steinman, M. E. Mann, S. K. Miller, Science 347, 988-991 
(2015). 

2. The standard deviation of intrinsic variability computed in 
Steinman et al. (1) is small, but not exactly zero because of 
their using a data adaptive low-pass filter before averaging 
intrinsic variability among different simulations. 

3. Steinman et al. also used weighted ensemble means to define a 
version of their model-based forced signal. In this case, the con- 
straint (Eq. 5) would not be exact but would still be approximately 
valid, because the weighted and nonweighted estimates of the 
forced signal are in fact very close (not shown here). 

4. Comment (3) above also applies to a possible variation of the 
regression method (Eq. 4B) in which, instead of scaling each 
individual simulation by its own factor a m , one would estimate 
and use the single scaling factor for all simulations of each 
model; this scaling factor can be defined, for example, as the 
ensemble mean of a m estimates computed for individual 
simulations of a given model. 

5. One way to try to alleviate constraint (Eq. 5) would be to 
estimate the forced signal for a given subset of models using 
the ensemble mean time series of the complement subset of 
models. However, this would only be effective if the sizes of 
these two subsets are comparable. Otherwise, the multimodel 
averaging over the much larger complement subset of models 
would also be very close to the all-model ensemble mean, and 
the algebraic constraint (Eq. 5) would still approximately hold. 

6. There are 13 models with four or more 20th-century 
simulations in the CMIP5 data set, but considering separately 
the ensembles of the Goddard Institute for Space Studies 
(GISS) models with different physics packages makes up 18 
independent ensembles. 

7. The Heaviside step function is used here merely to streamline 
the mathematical notations in the multiple correlation measure 
(Eq. 6) by zeroing out negative terms in the sum of 
correlations, leaving positive terms unchanged. 

8. Other models exhibit a similar behavior; see the corresponding 
images at https://pantherfile.uwm.edu/kravtsov/www/downloads/ 
KWCT2015/TI FF_FI LES. 

9. If one does not divide the large ensembles of the GISS models 
into the subensembles with different physics (6), the 
correlation-measure diagnosis does identify the dependency 
between the model realizations, because the true forced 
responses in these versions of the model are different from the 
grand ensemble mean response, and similar long-term biases 
across the same-physics model simulations ensue. 

10. The bootstrap resampling used in (1) is equivalent to considering 
subensembles of about two-thirds of independent models (or 
simulations), thus effectively averaging out the intramodel un- 
certainty of the forced response emphasized in our Fig. 1, D to F. 

11. This is exacerbated further by the unfortunate linear extrap- 
olation of the CMIP5 runs from 2005 to 2012 used in (1) to 
estimate recent intrinsic trends. 

12. B. Rajaratnam, J. Romano, M. Tsiang, N. S. Diffenbaugh, 

C\im. Change 133, 129-140 (2015). 
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Fig. 1. Intrinsic variability in the 20th-century model simulations with four or more ensemble 
members identified using two different methods for estimating the forced signal: the classical 
subtraction of the individual-model ensemble mean (method A) and the multimodel regional 
regression method (1) (method B). (A to C) The correlation measure (Eq. 6) of statistical indepen- 
dence between multiple realizations of the GFDL CM3 model (five realizations) for (A) Atlantic Multi- 
decadal Oscillation (AMO), (B) Pacific Multidecadal Oscillation (PMO), and (C) Northern Hemisphere 
Multidecadal Oscillation (HMO) indices; these correlations were computed for running-mean low-pass- 
filtered residual time series (which characterize intrinsic variability) and are plotted here against the 
averaging window size. Low correlation measure indicates statistical independence of intrinsic residuals. 
Dashed lines show the 99th percentile of the correlation measure based on the 1000 simulations of the 
corresponding AR-3 red-noise model. (D to F) Estimates of the observed multidecadal intrinsic 
variability for (D) AMO, (E) PMO, and (F) HMO. The semi-empirical estimates (thin black lines) were 
computed as in ( 1 ) based on the forced signals obtained using method A for each of the 18 model 
ensembles considered, with the heavy red line indicating the average over these individual estimates. 
Additional heavy lines (see legend) are for results based on linear detrending. The distance between the 
black dashed lines in each plot shows the 95th percentile of the standard deviations for multidecadal 
intrinsic variability estimated using method A for each of 116 simulations considered. 
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CLIMATE CHANGE 

Response to Comment on 
“Atlantic and Pacific multidecadal 
oscillations and Northern 
Hemisphere temperatures” 

B. A. Steinman, 1 *t L. M. Frankcombe, 2 f M. E. Mann , 3 S. K. Miller , 3 M. H. England 2 

Kravtsov et al. claim that we incorrectly assess the statistical independence of simulated 
samples of internal climate variability and that we underestimate uncertainty in our 
calculations of observed internal variability. Their analysis is fundamentally flawed, owing 
to the use of model ensembles with too few realizations and the fact that no one model can 
adequately represent the forced signal. 



K ravtsov et al. (i) wrongly claim (i) that 
our assertion of statistical independence 
among estimates of the internal varia- 
bility in regional temperature change in 
climate models (2) is an artifact of a flawed 
procedure. They further suggest (ii) that robust 
assessments of simulated internal variability can- 
not rely on a multimodel ensemble mean (MMEM) 
that differs from the true forced signal of the in- 
dividual models, because the residuals of the two 
forced signals masquerade as low-frequency in- 
ternal variability, which leads to correlation among 
ensemble members. Finally, they claim (iii) that 
we substantially underestimate the uncertainty 
in the semi-empirical estimate of internal varia- 
bility derived using the MMEM to approximate 
the forced signal. 

Regarding their first point, Kravtsov et al. 
assert (in their second reference/note) that the 
standard deviation of the mean of internal var- 
iability is not exactly zero only because the data 
were filtered before analysis [figure 2 and figures 
S2 to S4 in (2)]. This is true, however, only if the 
full ensemble of realizations, N, is used in the 
calculation. If (N- 1) realizations are instead used, 
as is the case in our analysis, the standard devia- 
tion of the mean is not zero (regardless of whether 
the data are filtered beforehand) (3). Our method 
for assessing statistical independence is valid. 

Regarding their second point, Kravtsov et al. 
claim that internal variability calculated using the 
regional regression method and a scaled MMEM 
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introduces errors because each model has a dif- 
ferent forced response. Instead, they use a single- 
model ensemble mean (SMEM) for models with 
four or more historical runs and show that the 
internal variability calculated in this manner has 
a lower variance and lower intramodel correla- 
tion than that determined using a MMEM [figure 
1, A to F, in (1)]. They assert that this occurs be- 
cause the difference between the MMEM and the 
true forced signal for individual models introdu- 
ces extra internal variability at low frequencies. 
Although this could be true in principle, this 
point is irrelevant because the ensembles of sim- 
ulated internal variability determined using re- 
gional regression nevertheless universally satisfy 
the requirements for statistical independence 
[figure 2 in (2)]. 

Moreover, we show that more than four (in- 
deed, more than 10) ensemble members are re- 
quired for a robust estimate of the forced signal 
from a SMEM, and thus the lower variance of the 
internal variability estimates is due to the small 
ensemble size, which leads to the removal of too 
much of the internal variability. We demonstrate 
this point using synthetic autoregressive-1 time 
series (Fig. 1, A and B), each with the same forced 
signal, and a different realization of red noise (4). 
We divide the 160-member ensemble into subsets 
of four- or ten-member ensembles, which were 
smoothed using a 5-year low-pass filter. Using the 
code provided by Kravtsov et al, we show that the 
application of small ensemble SMEMs results in 
lower variance of internal variability (Fig. 1A), 
even though in this case we know that the large 
ensemble yields a far more accurate estimate of 
the forced signal (Fig. IB). This idealized example 
reveals that the higher variance obtained when 
using the MMEM relative to using the SMEM is in 
fact due to the removal of too much internal 
variability when using the SMEM. 

Regarding their third point, Kravtsov et al. 
claim that there is a wide range of possible semi- 



empirical estimates of observed internal varia- 
bility resulting from the range of possible SMEM 
estimates of the forced signal. They attempt to 
assess the uncertainty in the Atlantic Multi- 
decadal Oscillation (AMO), Pacific Multidecadal 
Oscillation (PMO), and Northern Hemisphere 
Multidecadal Oscillation (NMO) by applying the 
mean of individual model ensembles with four 
or more realizations and claim that the resulting 
spread of these separate estimates defines un- 
certainty inherent in the regional regression 
method (5). However, as we have shown, none 
of these SMEMs is in itself a robust estimate of 
the forced signal, because none of the models 
have enough ensemble members for suitable 
cancellation of the different realizations of 
internal variability [which also explains the 
narrow 2a range in their figure 1, G to I (1)]. 
More importantly, it is unreasonable to expect 
an individual model to have a better represen- 
tation of the forced signal than the multimodel 
ensemble. In fact, the spread in the internal 
variability estimates of Kravtsov et al. [figure 1, 
G to I in (1)] supports this assertion, indicating 
that regional regression-based estimates of in- 
ternal variability and their uncertainties are 
intrinsically dependent on the choice of the forced 
signal and therefore that only robust estimates 
derived from a large number of models or real- 
izations should be used for this assessment. 
The MMEMs from the Coupled Model Inter- 
comparison Project Phase 5 [CMIP5-A11 (all mod- 
els) and CMIP5-AIE (models that include the 
first and second aerosol indirect effects)] applied 
in our study fulfill these requirements, whereas 
the SMEMs do not. By this rationale, the CMIP5- 
All mean provides the best overall estimate of 
the forced signal, with an uncertainty that can be 
estimated (among other methods) using boot- 
strap resampling [figure 3C in (2)]. 

Although the use of SMEMs based on a small 
number of realizations is an inherently flawed 
method, we nonetheless show that both the 
MMEM and SMEM regression-based methods 
provide better estimates of the internal signal 
than simple linear detrending, which produces a 
large overestimation of the variance of the in- 
ternal variability, especially in recent decades [ma- 
genta curves in Fig. 1, A and C, and figure 1, G 
to I in (1)]. Indeed, this is one of the key points of 
our original Report and is a principal focus of 
Frankcombe et al. (6). This recent study concludes 
that linear detrending introduces large biases in 
both the amplitude and phase of the internal 
variability and that regression-based approaches 
that rely on large historical simulation ensembles 
to estimate the forced signal produce less biased 
estimates of internal variability. 

The use of different MMEM and large ensem- 
ble SMEM estimates of the forced series (5) pro- 
duces internal variability trends that are generally 
consistent [see figure 3 and figure S6 in (2)]. For 
example, the AMO, PMO, and NMO behavior over 
the most recent two decades are in each case 
largely similar to one another (e.g., in the most 
recent decade: PMO decreasing, NMO decreasing, 
AMO flat) (7) and inconsistent with estimates of 
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Fig. 1. Estimates of AMO variance based on ensemble size and smoothing 
window. (A) The variance of internal variability in synthetic AMO time series 
produced using the same forced signal and different realizations of red noise. 
The mean variance (solid lines) and 2 a range (dashed lines) of the large 
ensemble (red, 160 members) and small ensembles (green, 10 members; blue, 
4 members) are shown as a function of the time-averaging window size. The 
small ensembles consist of random groupings taken from the large ensemble. 
Also shown are the variance of the original synthetic time series (not including 
the forced signal; black, almost coinciding with the red curve) and the variance 
obtained after linear detrending (magenta). (B) Individual estimates of the 



forced signal from the large and small ensembles means [colors as in (A)] 
shown with the true forced signal (black). Note the extremely high level of 
agreement between the large ensemble mean and the true forced signal 
relative to that of the small ensemble means. The synthetic time series 
(including the forced signal) are shown in gray. (C) The variance of semi- 
empirical estimates of AMO variability from the 18 CMIP5 models considered 
in Kravtsov et al. using multimodel ensemble mean regression (red), individual 
model ensemble mean subtraction (blue), and linear detrending (magenta). 
Ensemble mean variance (solid lines) and 2a ranges (dashed lines) are shown 
as a function of the time-averaging window size. 



internal variability derived from simple statistical 
methods (i.e., detrending, with the exception of, 
perhaps, the PMO). Furthermore, the principal 
conclusions of Steinman et al. ( 2 ) (regarding 
the recent slowdown in surface warming) have 
been supported by at least eight other promi- 
nent studies (8-16), of which the most recent 
(15) uses a semi-empirical method (in which 
the forced signal is estimated using a MMEM) 
that is very similar to the target region regres- 
sion method of Steinman et al. (2). 

In short, we find no merit to the criticisms of 
Kratsov et al. We once again emphasize that the 
linear detrending procedure used in their past 
work {16) leads to extremely biased estimates of 
internal variability and should not be employed. 
Our regression-based approach {2, 6), by contrast, 
yields faithful estimates of the internal variability. 
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Predicting protein 
quaternary structure 

Ahnert et al., p. 1331 




IN SCIENCE JOURNALS 



Edited by Stella Hurtley 




Colored scanning electron micrograph of an osteosarcoma cancer cell 



CANCER 

Out-RANKing osteosarcoma 

Osteosarcoma, the most common primary bone cancer, can 
be difficult to treat, especially in patients with metastatic 
disease. Chen etal. developed genetically engineered mouse 
models of osteosarcoma and used them to demonstrate 
that receptor activator of nuclear factor kB ligand (RANKL) 
signaling contributes to the progression of this disease. 
Furthermore, denosumab, an antibody against RANKL 
already used in patients with other bone diseases, inhibited 
osteosarcoma in mouse models and so is a viable candidate 
for future testing in human patients. — YN 

Sci. Transl. Med. 7, 317ral97 (2015). 
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CANCER IMMUNOTHERAPY 

Low mutation rate okay 
for T cells 

Cancers that tend to have high 
numbers of mutations, such 
as melanoma and smoking- 
induced lung cancer, respond 
well to immunotherapies, 
whereas those with fewer 
mutations, such as pancre- 
atic cancer, do not. Tran et al. 
searched for tumor mutation- 
reactive T cells in 10 patients 
with metastatic gastrointestinal 
cancers, which have relatively 
low mutation burdens, and 
discovered that 9 out of 10 
harbored such cells. T cells 
from one patient recognized 
a mutation common to many 
types of cancers. Engineering 
T cells to express this par- 
ticular mutation-reactive T cell 
receptor may extend adoptive 
cell immunotherapy to a larger 
pool of patients than previously 
anticipated. — KLM 

Science, this issue p. 1387 

AGING 

Aging: All in the head- 
and the gut 

The effects of hypoxia and 
caloric restriction, both of 
which extend life span in 
Caenorhabditis elegans, 
converge on the activation 
of an enzyme in cells of the 
intestine. Leiser et al. show 
that the life-extending effects 
of hypoxia begin in neurons 
with transcriptional activation 
by hypoxia-inducible factor-1 
and increased serotonergic 
signaling. These effects lead to 
increased production of flavin- 
containing monooxygenase-2 
(FMO-2) in the intestine, which 



increased longevity. Finding 
the relevant targets of FMO-2, 
which also accumulates in 
mammals under conditions 
that promote longevity, may 
elucidate further mechanisms 
that promote healthy aging. 

- LBR 

Science, this issue p. 1375 



ICE SHEETS 

Shrinking shelf and 
faster flow 

Zachariae Isstrom, a large glacier 
in northeast Greenland, began 
a rapid retreat after detaching 
from a stabilizing sill in the late 
1990s. Mouginot etal. report 
that between 2002 and 2014, 
the area covered by the glacier’s 
ice shelf shrank by 95%; since 
1999, the glacier’s flow rate has 
nearly doubled; and its accel- 
eration increased threefold in 
the fall of 2012. These dramatic 
changes appear to be the result 
of a combination of warmer air 
and ocean temperatures and the 
topography of the ocean floor 
at the head of the glacier. Rising 
sea levels should continue to 
destabilize the marine portion of 
Zachariae Isstrom for decades. 

— HJS 

Science, this issue p. 1357 




Greenland’s ice 
flows over the 
varied underlying 
terrain near the coast 
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SOCIAL BEHAVIOR 

From faithfulness to 
spatial memory 

Natural selection shapes traits 
to be as adaptive as possible for 
a given environment. Okhovat 
et al. show that such varying 
selection also shapes varia- 
tion in social behavior and the 
brain (see the Perspective by 
Robinson). Variation in male 
fidelity among monogamous 
prairie voles reflects differences 
in the expression, regulation, 
and epigenetic status of a vaso- 
pressin receptor that functions 
in spatial memory. Thus, the 
trade-offs males face between 
fidelity and infidelity may pro- 
mote heritable variation that is 
important for spatial memory 
and maintain variation in their 
levels of fidelity. - SNV 

Science, this issue p. 1371; 

see also p. 1310 



CANCER 

Growing blood vessels 
in gliomas 

Aggressive gliomas have a high 
density of abnormal blood ves- 
sels that enables tumor growth 
and damages the brain. Zhang 
etal. analyzed patient data and 
correlated increased levels of a 
secreted factor called pleiotro- 
phin with more aggressive grades 
of glioma and decreased survival. 
When implanted in mice, glioma 
cells that released pleiotrophin 
formed larger tumors with more 
blood vessels. Mice developed 
smaller gliomas and survived lon- 
ger when treated with inhibitors 
of ALK, a receptor for pleiotro- 
phin. -WW 

Sci. Signal. 8 , ral25(2015). 



GEOPHYSICS 

A mysterious mid-mantle 
slowdown 

The viscosity of Earth’s deep 
interior plays a key role in mediat- 
ing plate tectonics. Rudolph et 
al. combined several geophysical 
data sets to model the viscosity 
of the mantle. Mantle viscosity 
abruptly increases below 1000 
km. The increase could explain 



the stalling of subducting slabs 
and the deflections of hot upwell- 
ing plumes around this depth. 
Although the viscosity increase 
explains some recent unexpected 
observations, the origin of the 
jump itself remains a mystery. 
-BG 

Science, this issue p. 1349 



SUPERCONDUCTIVITY 

Locking the spins in a 
superconductor 

In Cooper pairs— pairs of elec- 
trons responsible for the exotic 
properties of superconduc- 
tors— the two electrons’ spins 
typically point in opposite direc- 
tions. A strong-enough external 
magnetic field will destroy 
superconductivity by making 
the spins point in the same 
direction. Lu etal. observed a 
two-dimensional superconduct- 
ing state in the material MoS 2 
that was surprisingly immune 
to a magnetic field applied in 
the plane of the sample (see the 
Perspective by Suderow). The 
band structure of MoS 2 and its 
spin-orbit coupling conspired 
to create an effective magnetic 
field that reinforced the electron 
pairing, with spins aligned per- 
pendicular to the sample. — JS 

Science, this issue p. 1353; 

see also p. 1316 



COGNITIVE SCIENCE 

Handwritten characters 
drawn by a model 

Not only do children learn 
effortlessly, they do so quickly 
and with a remarkable ability 
to use what they have learned 
as the raw material for creating 
new stuff. Lake etal. describe 
a computational model that 
learns in a similar fashion and 
does so better than current 
deep learning algorithms. The 
model classifies, parses, and 
recreates handwritten char- 
acters, and can generate new 
letters of the alphabet that look 
“right” as judged by Turing-like 
tests of the model’s output 
in comparison to what real 
humans produce. — GJC 

Science, this issue p. 1332 




Planting marsh grasses 
in a clumped, rather than 
dispersed, pattern can 
improve outcomes when 
restoring wetlands 



ten Mueller 
th 



RESTORATION ECOLOGY 

Planting to enhance positive 
interactions 

R ising sea levels and human development threaten 
coastal ecosystems worldwide. Efforts to protect and 
restore these environments include a range of physi- 
cal and biological measures, including reestablishing 
vegetation to stabilize the habitat and restore ecosys- 
tem function. Silliman et al. report the results of experiments 
in coastal wetlands in Florida and the Netherlands, show- 
ing that the simple pattern of planting (clumped versus 
dispersed) marsh grasses influences the success of restora- 
tion. Compared to a dispersed pattern of planting, planting 
marsh grasses in a clumped pattern enhanced their survival, 
biomass, and density. Clumped plants interacted positively; 
for instance, they reduced their anoxia stress, and neighbor- 
ing roots shared oxygen more readily, effectively producing a 
more oxygen-rich microenvironment than isolated individual 
plants can achieve. — AMS 

Proc. Natl. Acad. Sci. U.S.A. 112 , 14297 (2015). 
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GEOPHYSICS 

Setting the table for an 
old plate 

New pictures of Earth’s interior 
reveal an ancient tectonic plate 
sinking toward the bottom of the 
mantle. Simmons etal. find the 
structure in seismic tomography 
images under the data-limited 
region of the Indian Ocean. 
Complete subduction of the plate 
occurred more than 100 million 
years ago. This observation sug- 
gests that old plates can hang 
around in the mantle longer than 
previously thought. It also pro- 
vides a new piece of information 
for reconstructing plate motions 
and landmass locations in the 
distant past. — BG 

Geophys. Rev. Lett. 

10.1002/2015GL066237 (2015). 



BEHAVIOR 

Empathy is for the birds 

Many species empathize with 
one other by state matching, a 
process whereby an individual 
shifts its physiological state to 
match another’s. Increasing 
evidence suggests that empa- 
thy occurs across mammals, 
but whether other vertebrates 
empathize is unclear. Birds 
are prime candidates for state 
matching, given their extensive 
parental care, pair bonding, and 
sociality. Perez et al. now show 
that in highly monogamous 
zebra finches, females match 
their state of stress (as mea- 
sured by glucocorticoids) to that 




Monogamous zebra finches empathize 
through song 



SCIENCE sciencemag.org 




of their male mate, as perceived 
only through changes in his 
call. No match appeared when 
females listened to the calls of 
unknown males, which suggests 
that the close bond facilitates 
empathy. — SNV 

Horm. Behav. 10.1016/j. 
yhbeh. 2015. 09.002 (2015). 



AGING 

A circadian secret to 
a long life 

Dietary restriction (reduced 
nutrient intake without mal- 
nutrition) increases life span 
in multiple species, including 
fruit flies, but how this occurs is 
largely unknown. Katewa etal. 
now find that in flies, changes 
in the expression of genes that 
regulate the circadian clock 
underlie the life-extending 
effect of dietary restriction. 
Dietary restriction increased 
the amplitude of daily cycles 
of clock gene expression in 
tissues such as the fly head 
and body. In flies with mutated 
clock genes, dietary restriction 
failed to increase longevity. 
Moreover, genetically increas- 
ing the periodicity of circadian 
gene expression also extended 
the life span of flies. Further 
analysis indicated that circadian 
gene expression contributes 
to changes in metabolism that 
mediate the effects of dietary 
restriction on life span. — LBR 

Cell Metab. 10.1016/j.cmet.2015.10.014 
(2015) 



CATALYSIS 

Upgrading ethanol 
without adding hydrogen 

An issue with ethanol-usage 
mandates for gasoline is the 
“blend wall": Many vehicles 
can only use up to 10 to 15% 
ethanol. In order to use more 
ethanol as a fuel, it must be 
converted to hydrocarbons that 
blend into gasoline, but exist- 
ing processes have suffered 
from the need to add hydrogen 
or produce a high fraction of 
undesirable small hydrocar- 
bons. Narula et al. report the 
conversion of ethanol to larger 



hydrocarbons suitable for use 
in gasoline, as well as diesel 
and jet fuels, with a ZSM-5 
zeolite containing indium and 
vanadium. A hydrocarbon pool 
mechanism appears to operate 
without dehydration steps. 

- PDS 

Sci. Rep. 10.1038/srepl6039 (2015). 



NEPHROLOGY 

A circulating harbinger of 
kidney disease 

Chronic kidney disease affects 
600 million people worldwide. 
Initially asymptomatic, the dis- 
ease often follows a progressive 
course that can lead to heart 
disease and kidney failure. An 
easily measured biomarker that 
identifies people at high risk 
of developing kidney disease 
would allow doctors to inter- 
vene earlier so that patients’ 
disease progresses at a slower 
rate. Toward that end, Hayek 
etal. examined the predictive 
capacity of suPAR, a protein 
previously linked to a certain 
type of kidney disease. They 
serially monitored plasma levels 
of suPAR in a large group of 
patients, many with normal kid- 
ney function at the study’s start, 
and found that elevated levels 
of the protein correlated with a 



decline in kidney function and 
with new-onset chronic kidney 
disease. — PAK 

N. Engl. J. Med. 373 , 1916 (2015). 



PHYSICS 

Studying magnetism in 
an atomic chain 

When we think of magnets, 
solid materials generally come 
to mind, but researchers 
can also study magnetism by 
placing individual atoms in 
an egg-crate-like potential 
created by lasers: an optical 
lattice. This approach, however, 
requires extraordinarily low 
temperatures. To circumvent 
that requirement, Murmann et 
al. studied a string of three or 
four fermionic 6 Li atoms in an 
elongated optical trap. In this 
one-dimensional system, no 
lattice was needed because 
the atoms, tuned to interact 
strongly, spontaneously formed 
an ordered chain. The research- 
ers first placed three atoms in 
a particular spin configuration 
and then tilted the trap to let the 
outermost atom tunnel out. The 
spin state of the tunneling atom 
revealed that the atoms were 
initially in an antiferromagnetic 
state. — JS 

Phys. Rev. Lett. 115 , 215301 (2015). 
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MICROBIOME 

The ocean microbial 
system 

The vast translucent oceans 
are teeming with microscopic 
life that drives significant life 
processes and elemental cycling 
on Earth. Yet how climate 
change will affect the function- 
ing of this microbiome is not 
well understood. Moran reviews 
progress and the transformative 
discoveries made recently in 
marine microbiology that have 
led environmental, plant, animal, 
and even human microbiome 
research. — CA 

Science, this issue p. 1330 

PROTEIN STRUCTURE 

The principles of protein 
assembly 

A knowledge of protein 
structure greatly enhances 
our understanding of protein 
function. In many cases, 
function depends on oligomer- 
ization. Ahnert et al. used mass 
spectrometry data together 
with a large-scale analysis of 
structures of protein complexes 
to examine the fundamental 
steps of protein assembly. 
Systematically combining 
assembly steps revealed a large 
set of quaternary topologies 
that were organized into a 
periodic table. Based on this 
table, the authors accurately 
predicted the expected frequen- 
cies of quaternary structure 
topologies. — VV 

Science, this issue p. 1331 

PHYSICAL CHEMISTRY 

Shaking out details of 
transition states 

Chemists liken reaction energet- 
ics to a landscape with hills 
and valleys. In this context, the 



transition state represents the 
highest barrier that reagents 
must pass over en route to 
forming products. Baraban et 
al. introduce a framework for 
extracting details about the 
transition state of rearrange- 
ment reactions directly from 
vibrational spectral data. They 
identified a characteristic pat- 
tern in the spacing between 
vibrational energy levels near the 
transition state, which revealed 
its energy as well as the specific 
motions involved in surmounting 
the barrier. — JSY 

Science, this issue p. 1338 



SIGNAL PROCESSING 

Detecting a transient 
needle in a haystack 

Discriminating signals within a 
noisy environment is an issue 
crucial to many disciplines, 
from observational astronomy 
to secure communication and 
imaging. If the signal is periodic, 
then averaging over many mea- 
surements can help enhance the 
signal-to-noise ratio. However, 
for signals that present as a 
single transient event, the detec- 
tion capability has been limited. 
Ataie etal. developed a detector 
that can lift that limitation by 
combining signal cloning with 
frequency combs and signal- 
processing techniques (see the 
Perspective by Vasilyev). Their 
detector could detect signals 
buried within noise that would 
otherwise be undetectable. 
-ISO 

Science, this issue p. 1343; 

see also p. 1314 



SURFACE SCIENCE 

Sticking hydrogen atoms 
to surfaces 

The simplest case of adsorption 
at a surface— that of a hydrogen 



atom— is actually quite compli- 
cated. This is because it is not 
clear how this light atom can 
transfer enough momentum 
to the heavy surface that it can 
slow down and stick. Bunermann 
etal. prepared highly energeti- 
cally controlled hydrogen atoms 
(see the Perspective by Brune). 
On a gold surface, inelastic 
collisions occurred during 
adsorption, but not when an 
insulating layer of xenon atoms 
was used. — PDS 

Science, this issue p. 1346; 

see also p. 1321 



NEURONAL DYNAMICS 

In vivo imaging of 
neuronal voltage spikes 

Neuroscientists have long 
sought tools that allow optical 
imaging of individual neurons’ 
membrane voltage dynamics in 
awake behaving animals. Gong 
etal. genetically engineered a 
protein voltage indicator that 
can report action potentials 
with <l-ms precision and 
orders of magnitude lower 
spike detection error rates than 
were previously possible. They 
were thus able to record action 
potentials and membrane 
voltage dynamics in the brains 
of awake mice and fruit flies. 

- PRS 

Science, this issue p. 1361 



RESEARCH INVESTMENT 

Tracking the knowledge 
economy 

Although the U.S investment 
in scientific research can be 
documented readily, its output 
is harder to track. Zolas et al. 
combined data obtained from 
eight universities on their doc- 
torate recipients with data from 
business registries and the U.S. 
Census Bureau. This allowed 
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them to link Ph.D. recipients to 
all their subsequent employers. 
Doctoral recipients tended to 
stay in academia or join large 
companies with high salaries. 
Roughly 20% stayed in the 
state in which they received 
their degree. In the year after 
receiving a Ph.D., mathemati- 
cians and computer scientists 
received the highest salaries, 
and biologists received the 
lowest. — BJ 

Science, this issue p. 1367 

SIGNAL TRANSDUCTION 

Altering timing perturbs 
cell signaling 

Biological regulatory sys- 
tems have been optimized by 
evolution to accommodate 
environmental variation. Yet 
these systems may also have 
fragile aspects that can be 
exposed by variation in the tim- 
ing of signaling events. Mitchell 
etal. studied the properties 
of the yeast signaling system 
that allows cells to adapt to 
changing osmotic conditions. 
The same properties also 
made the system sensitive 
to hyperactivation and the 
consequent inhibition of cell 
growth if exposed to oscilla- 
tions in osmotic conditions 
with a particular frequency. 

The identification of similar 
fragility in other regulatory 
pathways might prove useful in 
the development of therapeutic 
strategies against diseases in 
which signaling is perturbed, 
such as cancer and diabetes. 

- LBR 

Science, this issue p. 1379 

TRANSCRIPTION 

“Please release me, 
let me go.” 

RNA polymerase II (Pol II) is 
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the principal protein complex 
required for gene transcription 
in metazoan cells. Many genes 
have a “paused” Pol II near 
their promoters, waiting to 
be released so they can start 
messenger RNA synthesis. Yu 
etal. show that Pol 1 1— associ- 
ated factor 1 (PAF1) plays a 
central role in regulating the 
activation of these paused 
Pol II complexes. The positive 
transcription elongation factor 
b helps recruit PAF1 to the 
paused Pol II. This facilitates 
the phosphorylation of Pol II 
on its C-terminal domain, free- 
ing it to start transcription in 
earnest. — GR 

Science, this issue p. 1383 



VITAMIN C RESEARCH 

Getting all stressed out 
by vitamin C 

Few experimental cancer ther- 
apies have incited as much 
debate as vitamin C. Yet the 
mechanistic effect of vitamin 
C on cancer cells is still poorly 
understood. Yun et al. studied 
human colorectal cancer cells 
with KRAS or BRAF mutations 
and found that they “handle” 
vitamin C in a different way 
than other cells, ultimately 
to their detriment (see the 
Perspective by Reczek and 
Chandel). Because a certain 
receptor is up-regulated in 
the mutant cells, they take up 
the oxidized form of vitamin 
C (dehydroascorbate). This 
leads to oxidative stress, inac- 
tivation of a glycolytic enzyme 
required by the mutant cells 
for growth, and finally cell 
death. Whether the selective 
toxicity of vitamin C to these 
mutant cells can be exploited 
therapeutically remains 
unclear. — PAK 

Science, this issue p. 1391; 

see also p. 1317 
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REVIEW SUMMARY 



MICROBIOME 

The global ocean microbiome 



Mary Ann Moran 

BACKGROUND: Oceanographers began study- 
ing the ocean microbiome in earnest over four 
decades ago, when it was recognized that mi- 
crobes are responsible for nearly all of the 
energy flux in this largest and most dilute 
biological system on Earth. Much has been 
learned about the microbes that play key roles 
in every marine element cycle, but much is 
still unknown about the factors regulating 
their activity. Although the number of marine 
microbes per liter of seawater reaches into 
the billions, their small size means that, stat- 
istically, each microbe is separated by 100 to 
200 body lengths from its closest neighbors. 
Yet recognition of microscale structuring of 
both microbial communities and marine or- 
ganic matter suggests that the ocean micro- 
biome does not operate as stand-alone cells in 
a watery soup. 

ADVANCES: Several decades of 165 ribosomal 
RNA gene analysis has revealed distinct and 
recurring bacterial communities in the ocean. 
More recent characterizations of marine ar- 
chaea, protists, and viruses are filling out the 



taxonomic inventories of the ocean microbiome 
and showing that membership is predictable 
over seasons, ocean depth, and organic matter 
features. The retrieval of proteorhodopsin— 
a gene that allows cells to harvest energy from 
sunlight without complex photosynthetic 
machinery— from an uncultured ocean mi- 
crobe marked the first exciting discovery from 
the use of “meta-omics” methodologies in the 
ocean. Now these techniques are the central 
tools for converting inventories of organisms 
and functions into explicit linkages between 
the two. Substantial progress has been made 
toward unraveling how and where microbes 
participate in ocean biogeochemical processes, 
as well as toward recognizing new categories 
of nonpredatory microbial alliances that op- 
erate based on the exchange of nitrogen, vita- 
mins, hormones, and antibiotics. 

Several characteristics of the ocean micro- 
biome distinguish it from microbiomes on or 
in animals, plants, and soils. First, the primary 
producers that fuel the ocean are exclusively 
microbial and thus are a part of the micro- 
biome. This is the case for photosynthesis in 



the surface ocean and for chemosynthesis car- 
ried out in deeper waters. The ocean micro- 
biome is responsible for half of all primary 
production occurring on Earth. Second, trophic 
categories are particularly difficult to assign in 
the ocean microbiome, with no clear division 
of organisms into canonical autotrophic and 
heterotrophic roles. Proteorhodopsin, anoxy- 
genic phototrophy, and chemolithotrophic en- 
ergy acquisition from inorganic compounds 
create trophic mayhem among members of 
the ocean microbiome. Having multiple strat- 
egies for meeting metabolic requirements may 
be an advantage in this chemically dilute and 
physically dynamic environment. Last, hetero- 
geneity in the structure of seawater organic 
matter has become a foundational concept 
for the ocean microbiome because it aligns 
with differences in mi- 
crobial attributes. Bacteria 
and archaea that live sin- 
gly in seawater differ from 
those that intermingle on 
the various marine poly- 
mer networks and organic 
surfaces in terms of phylogenetic affinity, me- 
tabolism, and capabilities for motility, chemo- 
taxis, and defense. Single bacteria and archaea 
are numerically dominant in terms of cells, 
genes, and transcripts, but those clustered near 
surfaces have higher per-cell rates of metabo- 
lism and growth. The importance of material 
exchanges and signaling networks between 
neighboring cells in the ocean, as well as the con- 
sequences spatial arrangements impose on bio- 
geochemical processing, are not yet understood. 



ON OUR WEB SITE 



Read the full article 
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Sampling the ocean microbiome. (Left) New instruments such as the Environmental Sample 
Processor (ESP) (Monterey Bay Aquarium Research Institute) autonomously sample ocean 
microbes and environmental conditions while deployed at sea. (Right) Microbial cells preserved in 
metal pucks inside the ESP are removed for gene expression analysis. 



OUTLOOK: Earth’s changing climate is pre- 
dicted to decrease carbon fixation by microbial 
primary producers, favor smaller picophyto- 
plankton over larger nano- and microphyto- 
plankton, and impose stress on photosynthetic 
microbes that form calcium carbonate shells. 
The structure of phytoplankton communities, 
in turn, has implications for the abundance 
and composition of organic substrates for het- 
erotrophic microbes, as well as for dictating 
which trophic strategies will be under selec- 
tion in the future ocean. Taking stock of the 
ocean microbiome in terms of cells, genes, 
transcripts, and proteins now has a long tra- 
dition in oceanography. Linking these stocks 
with the regulation of critical ecosystem func- 
tions is the next challenge. One key step in this 
process is the identification of the molecules 
that pass between microbes as substrates, nu- 
trients, signaling molecules, and defensive com- 
pounds; these are the “currencies” of ocean 
microbiome function. ■ 
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MICROBIOME 

The global ocean microbiome 

Mary Ann Moran 

The microbiome of the largest environment on Earth has been gradually revealing its 
secrets over four decades of study. Despite the dispersed nature of substrates and the 
transience of surfaces, marine microbes drive essential transformations in all global 
elemental cycles. Much has been learned about the microbes that carry out key 
biogeochemical processes, but there are still plenty of ambiguities about the factors 
important in regulating activity, including the role of microbial interactions. Identifying the 
molecular “currencies” exchanged within the microbial community will provide key 
information on microbiome function and its vulnerability to environmental change. 



T he ocean microbiome is a highly dilute mi- 
crobial system that covers the majority of 
Earth’s surface and extends an average of 
3600 m down to the seafloor. As one of the 
first microbiomes to be studied, the diver- 
sity and distribution of its members is now 
becoming familiar. Key questions remain, how- 
ever, regarding how functional capabilities are 
distributed and what biotic and environmental 
factors control their rates. 

A consistent link is emerging between ocean 
temperature and both the composition (1, 2) and 
productivity (3) of microbes inhabiting surface 
seawater. Stratification of the ocean into more 
discrete layers and consequent reduction in the 
mixing of nutrient-rich deeper waters to the sur- 
face may be one key mechanism behind this link 
(3). Decreases in ocean pH, a direct chemical 
outcome of increased absorption of C0 2 from the 
atmosphere, affect the energetic costs to microbes 
of obtaining inorganic carbon for photosynthesis 
and the availability of CaC0 3 for forming calcar- 
eous shells (4). To expand our evolving knowledge 
of the ocean microbiome, recent work has been 
directed toward explaining how the resident mi- 
crobes will be affected by the predicted impacts 
of climate change on the ocean ecosystem. 

In 1974, the idea that microorganisms are the 
major consumers of energy in the sea was for- 
mally articulated (5). Marine microbes <60 pm in 
diameter had gradually been recognized as re- 
sponsible for nearly all of the primary production 
and respiration occurring in the ocean. This “new 
paradigm” proposed four decades ago marked 
the first conceptualization of the ocean micro- 
biome, a community of microorganisms inhabit- 
ing 71% of Earth’s surface (6) and taking part in 
every one of its major elemental cycles. 

Despite population numbers reaching into the 
billions per liter of seawater, marine bacteria in 
today’s ocean are separated from their closest 
neighbor by an average of 200 body lengths, 
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due to their small size. The larger but less 
abundant microbial eukaryotes are separated 
by an average of -100 body lengths (7). Early 
views of the ocean microbiome were therefore 
premised on the idea that interactions between 
members were indirect, except in predator-prey 
relationships. Microbes added and removed 
organic and inorganic materials from the pool 
of nonliving matter in seawater, and their in- 
teractions were mediated largely through the 
compounds in this pool. Yet the groundwork 
for recognizing the presence of structured micro- 
bial communities, even in dilute ocean waters, 
had already been laid (8). Bacteria were shown 
to exhibit chemotaxis to phytoplankton exudates 

(9) and to gain energy benefits when clustered 
around patches of concentrated organic matter 

(10) . These were early hints that the ocean mi- 
crobiome does not operate as stand-alone cells 
in a watery soup. 

The organic matter continuum 

The microscale structure of organic materials in 
seawater that is relevant at the microbial scale, 
although daunting in its complexity, is now bet- 
ter understood. At the low end of the size range, 
seawater harbors a pool of dissolved organic mol- 
ecules released from plankton or leaked during 
predatory interactions. There are tens of thou- 
sands of different structures of these dissolved 
compounds (11), an overwhelming diversity 
whose origin and fate are only beginning to be 
understood (12, 13). Intermediate structures in- 
clude three-dimensional polymer networks, or 
“microgels” (14), that can serve as loci for mi- 
crobial clustering (IS), followed by “macrogels” 
(larger polymer networks, often dominated by 
polysaccharides). At the upper end of the size 
range are eukaryotic plankton cells, both alive 
and dying. Although eukaryotic microbes typi- 
cally are not heavily colonized by bacteria or 
archaea when healthy (16), the larger cells create 
zones of concentrated dissolved organic matter 
extending from their surfaces— these zones are 
referred to as “phycospheres” (9) (Fig. 1). Fecal 
pellets and other remnants of multicellular 



organisms also contribute to the organic matter 
matrix. 

The heterogeneity in the structure of seawater 
organic matter is a foundational concept for the 
ocean microbiome because it aligns with differ- 
ences in composition, metabolism, and genome 
content of the member microbes. Bacteria and 
archaea that live singly in the ocean passively 
encounter small molecules and efficiently scav- 
enge them at low concentrations (17). Those that 
intermingle with the various types of polymer 
networks and organic surfaces have better access 
to substrates but more neighbors to contend 
with compared to the free-living cells (Fig. 1). 
Along with differences in phylogenetic affiliation 
(18) and metabolic capabilities (19, 20), these 
bacteria and archaea, compared to their free- 
living counterparts, have genes biased toward 
capabilities such as signal transduction, defense, 
anaerobic metabolism, and carbohydrate pro- 
cessing (21, 22) and use motility and chemotaxis 
to find substrate-rich plumes and filaments as- 
sociated with the organic matrix (23, 24). In some 
ways it is surprising that these differences con- 
sistently emerge, despite the fact that bacteria 
and archaea are quite coarsely assigned their 
place in the organic matter continuum based on 
passage through a filter: Those passing through a 
pore of a few micrometers in diameter (typically 
2.0 pm) are considered free-living, whereas those 
retained are considered matrix-associated. Eu- 
karyotic plankton are also divided across this 
size-based boundary. Small eukaryotes (picoeu- 
karyotes) are collected with the free-living bacte- 
ria and archaea by virtue of their small size (25). 
Larger eukaryotic microbes (nano- and micro- 
eukaryotes) are collected with matrix-associated 
bacteria and archaea. 

How do inventories and activities compare be- 
tween microbes associated with different organic 
matter features of the ocean microbiome? As a 
general rule of thumb, free-living bacteria and 
archaea account for more cells (-80%) and more 
genes and transcripts per volume of seawater (26) 
than those that are matrix-associated (Fig. 2). Yet 
per-cell rates of metabolism and growth (27, 28) 
coincident with per-cell inventories of genes 
and transcripts (26, 29) are higher in the organic 
matrix-associated bacteria and archaea. Cyano- 
bacteria and picoeukaryotes, minute enough to 
pass through a 2.0-pm filter pore, are the smal- 
lest photosynthetic organisms of the ocean. In 
seawater, picoeukaryotes are typically an order 
of magnitude fewer in number than cyanobacteria, 
but they contribute more biomass per cell (30) 
and can produce nearly equivalent numbers of 
transcripts (Fig. 2). The larger eukaryotes are 
generally most abundant where nutrient levels 
are high, such as in coastal oceans or upwelling 
regions, and can sometimes rival heterotrophic 
bacteria in their contribution of microbial genes 
and transcripts to seawater (Fig. 2). 

The assemblage 

Early studies typically used the word “assem- 
blage” to refer to microbes in seawater; this term 
indicates that encounters might be too infrequent 
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and random to be considered a structured eco- 
logical community. Eventually, distinct and pre- 
dictable microbial communities were identified 
in the ocean, enabled by new methodologies that 
did not depend on cultivation. In the first appli- 
cation of a 1 6S ribosomal RNA (rRNA) gene 
amplicon surveying to a natural ecosystem, three 
previously unidentified bacterial lineages were 
discovered among just 12 sequences amplified 
from Sargasso Sea surface waters (31). The SAR11 
lineage was uncovered in this pioneering survey 
and is now recognized as the most abundant bac- 
terial group in the world’s oceans (32, 33). Several 
decades of 165 rRNA gene analysis have made us 
quite knowledgeable about the identity, relative 
abundance, and patterns of occurrence of marine 
bacteria and archaea (2, 34, 35). Most marine mi- 
crobial communities are composed of few domi- 
nant species and a long tail of rare ones (36), and 
species are divided into coexisting populations 
that imply fine-scale dissection of available re- 
sources (37, 38). Latitudinal patterns in species 
richness similar to those for macroorganisms 
have been observed (1), and community compo- 
sition is predictable over time (39), depth (40), 



and in relation to the structure and composition 
of seawater organic matter (18, 41-43). 

More recently, attention has turned to the tax- 
onomic characterization of protists [recognized 
in 1974 as important in ocean microbial com- 
munities (1)], viruses [joining the ocean micro- 
biome in 1989 (44, 45)\, and archaea [joining in 
1992 (46, 47)]. A recent comprehensive account- 
ing of ocean microbes by the Tara Oceans ex- 
pedition found 37,000 bacterial and archaeal 
species [based on 16 million metagenomic 165 
rRNA genes (2)], 100,000 protist groups [based 
on 580 million amplified 185 rRNA genes (48)], 
and 5500 viral populations [based on 2 million 
metagenomic sequences and considering only 
double-stranded DNA bacterial and archaeal vi- 
ruses (49)]. If each sample in this survey is rep- 
resentative of a 0.25-km 3 homogeneous parcel of 
seawater (35), we now have a snapshot in time of 
a 26-millionth of the ocean microbiome. 

Functions in the ocean 

A gene retrieved from an uncultured coastal ocean 
bacterium in 1999 challenged perceptions about 
energy acquisition by the ocean microbiome (50). 



This gene encodes a rhodopsin protein— belonging 
to a family previously known only from microbes 
living in extreme environments— that allows cells 
without photosynthetic machinery to harvest en- 
ergy through sunlight-driven proton pumping. 
The discovery overturned a fundamental distinc- 
tion between microbes obtaining energy from 
sunlight versus from organic matter and was a 
“tip-of-the-iceberg” moment that hinted at other 
notable microbial physiologies yet to be dis- 
covered. Before this study, the main strategy for 
linking microbes to their roles in the ocean was 
through “guilt by association,” with functional 
roles of organisms known only by their 165 rRNA 
gene sequences inferred from the physiology of 
their cultured relatives (if they existed). But the 
discrepancies in evolutionary history between 
the highly conserved phylogenetic marker genes 
and the highly diverse accessory genes (more 
easily shaped by forces of selection, drift, and hor- 
izontal transfer) made this an unsatisfying ap- 
proach. Further, it confined progress in functional 
understanding to physiologies known from a nar- 
row suite of cultured microbes. 

The proteorhodopsin discovery emerged from 
the idea to clone random fragments of bacterial 
DNA from seawater (51) and then sequence the 
regions flanking 165 rRNA genes (50). Eventu- 
ally, capabilities for this proteorhodopsin-based 
energy transduction were found in >40% of bac- 
terial cells in the surface ocean (52), representing 
all major heterotrophic bacterial lineages (53), as 
well as in marine archaea (54), microbial eukary- 
otes (55, 56), and even viruses (57). The fact that 
such a fundamental energy-acquisition mecha- 
nism had gone unnoticed for decades was equal- 
ly sobering and exhilarating. 

Metagenomics and other “meta-omics” meth- 
odologies thus became central tools for turning 
disjointed inventories of organisms and func- 
tions into explicit linkages between them. For 
example, our understanding of nitrogen controls 
on marine primary production was advanced 
when a widespread unicellular N 2 -fixing cyano- 
bacterium, known only by its nitrogenase gene 
sequence, was successfully hunted down (58). 
Metagenomic sequencing of an enrichment of 
these elusive UCYN-A cells revealed a surprising 
cyanobacterial genome lacking genes for photo- 
system II and carbon fixation and therefore 
represented a novel category of ocean microbial 



Fig. 1. Microscale structure of 
the ocean microbiome. The 

ocean microbiome is dispersed 
across a dilute matrix of organic 
matter, populated by bacterial, 
archaeal, and eukaryotic cells 
using complex trophic strategies 
and fueled by microbially fixed 
carbon. Molecules, gels, and a 
living diatom represent the 
organic matter matrix of 
seawater. Dashed lines indicate 
predatory or nonpredatory inter- 
actions between microbiome 
members. 
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Fig. 2. Organisms and “omics.” (A) Typical inventory of microbial cells and (B) microbial genes and transcripts per liter of ocean water (26, 29). 
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function that potentially decoupled carbon and 
nitrogen fixation (58). Similarly, characterization 
of the marine bacteria responsible for metaboliz- 
ing a sulfur-containing phytoplankton osmolyte 
(dimethylsulfoniopropionate) began with gene dis- 
covery efforts in cultured marine strains (59-61), 
followed by metagenomic data mining (52, 59) 
and then autonomous ocean sensing (62). Reg- 
ulation of these bacterial genes affects the pri- 
mary source of biogenic sulfur emissions to the 
atmosphere (63) and influences sulfur cycling on 
a global scale (64, 65). 

The Global Ocean Sampling data set released 
in 2004 consists of 1 gigabyte of metagenomic se- 
quence data from the North Atlantic Ocean (66); 



the Tara Oceans data set released in 2015 consists 
of 7200 gigabytes of metagenomic data from mul- 
tiple locations and depths across the globe (2). 
Our knowledge of marine microbial function has 
been rapidly propelled by the availability of these 
and other marine meta-omic data sets (67, 68). 
They provide windows into the multitude of or- 
ganisms, genes, and transcripts of the ocean mi- 
crobiome that run the global elemental cycles, 
buffer atmospheric C0 2 concentrations, and are on 
the front line of Earth’s changing climate (3, 69). 

Neighborhood associations 

The assemblage perspective of early researchers 
has given way to a growing understanding of the 



diverse mechanisms by which ocean microbes 
interact. Even though microbial cells occupy only 
-0.0001% of seawater volume (70), predatory ac- 
tivities are surprisingly efficient. Between 20 and 
50% of bacterial cells are killed each day in the 
ocean by viral infection (71), and a similar amount 
are consumed by protist grazers (72). Microbial 
predators and prey are engaged in sophisticated 
evolutionary arms races; for example, cyano- 
bacterial viruses carry copies of photosynthesis 
genes, originally captured from their hosts, that 
ensure a sufficient number of light-harvesting pro- 
teins during later stages of infection (73). Whole 
new categories of nonpredatory microbial alli- 
ances are also being recognized. These include 
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bacteria that release a hormone promoting phy- 
toplankton cell division (74); phytoplankton that 
synthesize novel organic molecules that only cer- 
tain bacterial species can use as substrates (75); 
nitrogen-fixing cyanobacteria that are endo- 
symbiotic (76) or episymbiotic (77) with eukaryotic 
hosts; ciliates that borrow functional chloroplasts 
from their otherwise-digested prey (78); microbes 
that rely on neighboring cells for essential vita- 
mins and defensive enzymes because they have 
lost their own capability for synthesis (79, 80); 
and bacteria that attack neighboring microbes 
through the production of algicides (81), antibac- 
terial compounds (82), or both (83). At the level 
of community assembly, there is evidence of pat- 
terns in microbial composition and function over 
time frames of tidal cycles, days, and seasons 
(40, 84, 85). The limitation of relying on filters 
to distinguish microbial roles in the ocean mi- 
crobiome is now being complemented by meth- 
ods that track individual cells moving through 
seawater (86), quantify metabolite exchanges be- 
tween two microbes (87), and sequence parasite 
or symbiont genomes inside a single host mi- 
crobial cell (88). Our appreciation of the richness 
of microbial interactions in the ocean and their 
relevance to microbial function is in the very 
early stages. 

An unusual microbiome 

Several combined features of the ocean micro- 
biome distinguish it from those on or in animals, 
plants, and soils. First, the organisms responsible 
for fueling the system through primary produc- 
tion are exclusively microbial and, thus, part of 
the microbiome rather than external to it. This is 
the case for photosynthesis orchestrated by bac- 
teria and eukaryotes in the surface ocean and 
also for chemosynthesis carried out by archaea 
and bacteria (89). The ocean microbiome is re- 
sponsible for half of all primary production oc- 
curring on Earth (90). 

Trophic categories are particularly difficult to 
assign in the ocean microbiome (Fig. 3). The di- 
vision of organisms into strict autotrophic and 
heterotrophic roles began to unravel with the 
realization that proteorhodopsin allowed other- 
wise heterotrophic microbes to harvest light en- 
ergy (91). It continued with the discovery of 
anoxygenic phototrophic bacteria in the ocean 
that acquire energy from sunlight, using bacte- 
riochlorophyll a and associated photosynthetic 
reaction centers. However, unlike photosynthetic 
microbes, these bacteria are unable to fix C0 2 
(92-94). Other ocean microbes are able to obtain 
energy from the oxidation of both inorganic 
compounds, such as carbon monoxide and sul- 
fide, and organic compounds (95). One hypoth- 
esis to explain the choreographed trophic mayhem 
of the ocean microbiome is that variability in 
organic matter and light, as microbes are mixed 
into and out of surface waters and sink to deep 
ocean layers, creates an advantage for organisms 
with numerous metabolic mechanisms. Another 
is that very low concentrations and supply rates 
of the individual organic resources in oligotro- 
phic seawater select for microbes that can take 



advantage of many different energy and carbon 
sources. 

Finally, because the majority of microbes in 
the ocean microbiome are free-living, only those 
cells populating the organic matter matrix ex- 
perience physical interactions at the scale typical 
of most other microbiomes (Fig. 1). Questions 
regarding how close microbes need to be to es- 
tablish networks of signaling and material ex- 
change (96-98) and what consequences spatial 
arrangements impose on biogeochemical pro- 
cessing (24, 99, 100) are still awaiting answers. 

Earth’s changing climate will affect character- 
istics of the ocean microbiome. Strengthening of 
stratification under a warming climate is widely 
predicted to decrease carbon fixation rates of the 
ocean’s microbial primary producers (67) and to 
favor picophytoplankton over larger nano- and 
microphytoplankton because the former are bet- 
ter competitors for nutrients (101, 102). Members 
of the phytoplankton community that form cal- 
cium carbonate shells are anticipated to be neg- 
atively affected by increasing seawater pH (103). 
In turn, phytoplankton composition has impli- 
cations for the abundance and composition of 
the polymer networks and organic surfaces that 
are the substrates for heterotrophic microbes 
(104). Picophytoplankton may be of insufficient 
size to create phycospheres detectable by chemo- 
sensory mechanisms (105), and their dominance 
would favor free-living over matrix-associated 
bacteria and archaea (104). Certainly, the intri- 
cate trophic schemes of marine microorganisms 
will be subject to redesign in the future ocean, 
with changes in surface seawater temperature 
(95), C0 2 concentrations (96), 0 2 concentrations 
(95), nutrient regimes (68), and light availability 
(97) shifting the physiological and ecological pay- 
offs of a microbe’s life history strategy. 

Conclusions 

The ocean microbiome fits the definition of an 
ecological community of microorganisms that 
share an environment (106). As is true for all mi- 
crobiomes, it is considerably easier to inventory 
microbial parts (cells, genes, transcripts, proteins, 
regulatory molecules) than to link the parts with 
their functions in the community or to understand 
the give-and-take relationships with a shifting en- 
vironment. Identifying the molecules— whether 
as substrates, nutrients, signaling molecules, or 
defensive compounds— that pass between mi- 
crobes is an emerging area of microbiome research 
that will focus attention on the “currencies” of 
microbial activity. Ultimately, the flux of materi- 
als, both among microbes and between microbes 
and their surroundings, constitutes microbiome 
function and determines its response to external 
changes. 
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Principles of assembly reveal a 
periodic table of protein complexes 
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INTRODUCTION: The assembly of proteins 
into complexes is crucial for most biological 
processes. The three-dimensional structures 
of many thousands of homomeric and het- 
eromeric protein complexes have now been 
determined, and this has had a broad im- 
pact on our understanding of biological 
function and evolution. Despite this, the 
organizing principles that underlie the great 
diversity of protein quaternary structures 
observed in nature remain poorly under- 
stood, particularly in comparison with pro- 
tein folds, which have been extensively classified 
in terms of their architecture and evolution- 
ary relationships. 

RATIONALE: In this work, we sought a com- 
prehensive understanding of the general 
principles underlying quaternary structure 
organization. Our approach was to consider 
protein complexes in terms of their assem- 
bly. Many protein complexes assemble spon- 
taneously via ordered pathways in vitro, and 
these pathways have a strong tendency to be 
evolutionarily conserved. Furthermore, there 
are strong similarities between protein com- 



plex assembly and evolutionary pathways, with 
assembly pathways often being reflective of 
evolutionary histories, and vice versa. This 
suggests that it may be useful to consider 
the types of protein complexes that have 
evolved from the perspective of what as- 
sembly pathways are possible. 

RESULTS: We first examined the fundamen- 
tal steps by which protein complexes can as- 
semble, using electrospray mass spectrometry 
experiments, literature-curated assembly data, 
and a large-scale analysis of protein complex 
structures. We found that most assembly steps 
can be classified into three basic types: di- 
merization, cyclization, and heteromeric sub- 
unit addition. By systematically combining 
different assembly steps in different ways, we 
were able to enumerate a large set of possible 
quaternary structure topologies, or patterns 
of key interfaces between the proteins within 
a complex. The vast majority of real protein 
complex structures lie within these topologies. 
This enables a natural organization of protein 
complexes into a “periodic table,” because 
each heteromer can be related to a simpler 



symmetric homomer topology. Exceptions 
are mostly the result of quaternary structure 
assignment errors, or cases where sequence- 
identical subunits can have different in- 
teractions and thus introduce asymmetry. 
Many of these asymmetric complexes fit 
the paradigm of a peri- 
odic table when their as- 

Read the full article sembl y role is considered, 

at http://dx.doi. Finally, we implemented 

org/10.1126/ a model based on the 

science. aaa2245 periodic table, which pre- 

dicts the expected fre- 
quencies of each quaternary structure topology, 
including those not yet observed. Our model 
correctly predicts quaternary structure to- 
pologies of recent crystal and electron micros- 
copy structures that are not included in our 
original data set. 

CONCLUSION: This work explains much of 
the observed distribution of known protein 
complexes in quaternary structure space and 
provides a framework for understanding 
their evolution. In addition, it can contrib- 
ute considerably to the prediction and mod- 
eling of quaternary structures by specifying 
which topologies are most likely to be 
adopted by a complex with a given stoichi- 
ometry, potentially providing constraints for 
multi-subunit docking and hybrid methods. 
Lastly, it could help in the bioengineering of 
protein complexes by identifying which topol- 
ogies are most likely to be stable, and thus 
which types of essential interfaces need to be 
engineered. ■ 
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Protein assembly steps lead to a periodic table of protein complexes and can predict likely quaternary structure topologies. Three main 
assembly steps are possible: cyclization, dimerization, and subunit addition. By combining these in different ways, a large set of possible quaternary structure 
topologies can be generated. These can be arranged on a periodic table that describes most known complexes and that can predict previously 
unobserved topologies. 
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periodic table of protein complexes 
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Structural insights into protein complexes have had a broad impact on our understanding of 
biological function and evolution. In this work, we sought a comprehensive understanding of the 
general principles underlying quaternary structure organization in protein complexes. We first 
examined the fundamental steps by which protein complexes can assemble, using experimental 
and structure-based characterization of assembly pathways. Most assembly transitions can be 
classified into three basic types, which can then be used to exhaustively enumerate a large set 
of possible quaternary structure topologies. These topologies, which include the vast majority 
of observed protein complex structures, enable a natural organization of protein complexes into a 
periodic table. On the basis of this table, we can accurately predict the expected frequencies 
of quaternary structure topologies, including those not yet observed. These results have 
important implications for quaternary structure prediction, modeling, and engineering. 



E volution has given rise to an enormous var- 
iety of protein complexes (1-3). The organ- 
izing principles that underlie this diversity 
remain poorly understood, particularly in 
comparison with protein folds, which have 
been classified extensively in terms of their ar- 
chitecture (4-6) and evolution (7, 8). However, net- 
work models have shown considerable promise in 
recent years for characterizing and comparing pro- 
tein complexes. For example, complexes are often 
represented as networks of associations between 
proteins, with little consideration for structure or 
stoichiometry. Alternatively, a graph representa- 
tion, which we introduced several years ago, can be 
used to capture the main features of quaternary 
structure topology (9). In this model, the nodes are 
the polypeptide chains, defined by their amino acid 
sequence and often referred to as subunits, and the 
edges are the interfaces between physically inter- 
acting chains, weighted according to size. 

Many protein complexes assemble spontaneously 
via ordered pathways in vitro, and we have shown 
that these assembly pathways have a strong ten- 
dency to be evolutionarily conserved (10, 11). Fur- 
thermore, there are strong similarities between 
protein complex assembly and evolutionaiy path- 
ways, with assembly pathways often being reflective 
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of evolutionaiy histories, and vice versa (12). Thus, 
quaternary structure evolution essentially can be 
thought of as an assembly process occurring on 
an evolutionary time scale. This suggests that it 
may be useful to consider the types of protein com- 
plexes that have evolved from the perspective of 
assembly pathways. 

In this work, we attempted to understand and 
explain the organization of protein complexes in 
quaternary structure space, using the principles 
of assembly. First, by characterizing the assembly 
pathways of a large number of protein complexes, 
we found that assembly can be explained gener- 
ally by three basic steps: dimerization, cyclization, 
and subunit addition. Combinations of these steps 
allow us to exhaustively enumerate possible qua- 
ternary structure topologies within a given re- 
gion of quaternary structure space. 

To achieve this, we considered each polypeptide 
chain as a distinct self-assembly building block 
and considered all the ways in which interfaces 
can be distributed across the chains that are pres- 
ent in the complex. The large variety of possible 
topologies generated by this approach were then 
compared to observed structures. We found that 
-92% of known protein complex structures are 
compatible with this model. 

A major benefit of this assembly-centric view 
of protein complexes is that it enables a natural 
organization of complexes into a “periodic table,” 
ordered by the number of subunit repeats (r) and 
the number of subunit types that are unique with- 
in a given complex (s). Exceptions are primarily 
the result of quaternary structure assignment er- 
rors or cases where sequence-identical subunits 
can have different interactions and thus introduce 
asymmetry. Many of these asymmetric complexes 
fit the paradigm of a periodic table when their 
assembly role (rather than their subunit identity) 
is considered. 



Finally, by combining the periodic table with our 
enumeration, we introduced a model to predict the 
expected frequencies of different quaternary struc- 
ture topologies. Not only does this model effective- 
ly replicate the relative frequencies of known protein 
complex structures, it also predicts the new topol- 
ogies that are most likely to be observed in the fiiture. 

A survey of transitions in the assembly 
pathways of protein complexes 

To understand the principles that underlie qua- 
ternary structure organization, it is useful to be- 
gin by considering the different ways in which 
protein complexes can assemble. We therefore first 
sought to determine the assembly and disassembly 
[“(dis)assembly”] pathways for as many protein 
complexes as possible. Previously, we have used 
electrospray mass spectrometry to characterize 
the (dis)assembly of eight homomers (10) and eight 
heteromers (11, 13). Whereas the homomers fol- 
lowed simple pathways, more diversity was ob- 
served for the heteromeric complexes. For this 
reason, in this study, we experimentally charac- 
terized the (dis)assembly pathways of nine addi- 
tional heteromers with widely varying quaternary 
structures (Fig. 1). In all of these cases, well-defined 
intermediate subcomplexes could be identified 
under at least one set of experimental conditions. 
All eight homomers and 15 of the 17 heteromers 
characterized by electrospray mass spectrometry 
to date have stoichiometries under native con- 
ditions that are consistent with the published biol- 
ogical units in the Protein Data Bank (PDB). 

We also searched the literature for protein com- 
plexes of known structure for which experimental 
(dis)assembly data are available, as we have done 
previously (10, 11). Often, these are cases where at 
least two different oligomeric states have been 
observed under equilibrium conditions. In total, 
we identified 11 homomers and 13 heteromers for 
which some (dis)assembly information is availa- 
ble in the literature. 

We obtained further information on protein 
assembly by considering the large number of pro- 
tein complexes of known structure. We searched 
for pairs of protein complexes where the quater- 
nary structure of one complex could be described 
as a subset of the other. Such pairs include, for 
example, a homodimer and a homotetramer with 
highly similar or identical sequences, suggesting 
that the tetramer assembles via a dimeric inter- 
mediate. Also included are homomer-heteromer 
pairs, where the heteromer has acquired a subunit 
with respect to the homomers. In total, this ap- 
proach identified 154 homomers and 263 heteromers 
with putative structure-based assembly information. 

We recognize that the structure-based path- 
ways do not represent direct characterization of 
assembly. Instead, they indicate that two or more 
different quaternary structure states have been 
observed, and we assume that assembly transi- 
tions can occur between them. Even for biophysi- 
cally characterized assembly pathways, we do not 
always have evidence that they are physiologically 
relevant. However, the fact that the biophysical 
and structure-based pathways have a strong ten- 
dency to reflect evolutionary history (10) and to 
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Fig. 1. Mass spectrometry characterization of heteromer (dis)assembly 
pathways. For each characterized complex, the known three-dimensional 
structure is shown with a representative mass spectrum, accompanied by 
graph representations of the full complex and subcomplexes. In all cases, the 
full complex is represented by the rightmost graph. A full list of subcomplexes 
is provided in table SI. The structures of 3DVA, 3080, and 4B7Y shown here 



differ from those in the PDB: 3DVA is missing the y subunit, because it was 
not present in our sample, and the 4:4 model of 3080 and the 4:2 model of 
4B7Y were built from the unit cell to match the mass spectrometry data. 
Colors in the graph representations indicate homomeric isologous (green), 
homomeric heterologous (blue), and heteromeric heterologous (red) interfaces: 
shapes indicate different subunit types. 



be evolutionarily conserved (11) does suggest that 
they have a functional relevance. 

Given this large set of assembly data, we next 
asked what quaternary structure transitions (as- 
sembly steps) tend to be observed. For homomeric 
complexes, we classified all possible transitions 
into three types (Fig. 2A, left). First, there is di- 
merization, where a doubling of the complex 
occurs and a twofold axis of rotational symmetry 
is formed (e.g., monomer-to-dimer or dimer-to- 
tetramer). Second, there is cyclization, which 
involves the assembly of a ring-like quaternary 
structure with higher-order rotational symmetry 
(e.g., monomer-to-trimer or monomer-to-tetramer). 
Third, there is fractional transition, an inherently 
asymmetric step in which the quaternary struc- 
ture changes by a non-integer ratio (e.g., dimer- 
to-timer or trimer-to-tetramer). 

For each homomer with assembly data, we iden- 
tified all the assembly steps that could account for 
the transitions between the free monomers, the 
observed subcomplexes, and the full complex (see 
Methods). The distributions of these three different 
assembly steps are shown in Fig. 2B. All three data 
sets show a similar trend, with dimerization being 
the most common step, cyclization being the next 
most common, and fractional transitions being rare. 
This is consistent with previous observations of the 
favorable assembly and evolutionaiy transitions be- 
tween homomers with different symmetries (10). 



In heteromers, there are two further assembly 
steps that are possible, in addition to the three 
steps observed for homomers. These are illus- 
trated in Fig. 2A (right): subunit addition, in which 
a new subunit is acquired (e.g., monomer-to- 
heterodimer); and nonstoichiometric transition, 
in which the types of subunits within the hetero- 
mer remain the same, but their relative ratios 
change (e.g., assembly from 1:1 to 2:1 stoichiometry). 

The distributions of all five possible assembly 
steps for heteromers are shown in Fig. 2C. The 
same trend is observed among the three homo- 
meric steps, with dimerization being the most com- 
mon and few fractional transitions. However, across 
all five possible steps, the most common observed 
step for heteromers from all three data sets is het- 
eromeric subunit addition. 

Within the heteromers, there is a difference 
between the transitions observed in the mass 
spectrometry data and those recorded in the other 
data sets. Specifically, nonstoichiometric transitions 
are much more common in mass spectrometry 
data, as evident from the considerable number of 
subcomplex intermediates with uneven stoichi- 
ometry (different numbers of each subunit type) 
shown in Fig. 1. This can be attributed to two fac- 
tors: the sensitivity of the mass spectrometry 
measurements to low-populated assembly inter- 
mediates, and the way in which the mass spec- 
trometry experiments are performed— namely, 



over a range of destabilizing solution conditions 
designed to progressively disrupt the quaternary 
structure of the complex. We know that such non- 
stoichiometric transitions must occur in many 
cases where they are not observed. For example, 
consider the transition from an AA homodimer 
to a BAAB heterotetramer, where there is no in- 
teraction between the two B subunits. In this case, 
an AAB assembly intermediate should form, giv- 
en that it is highly improbable that two separate 
B subunits would bind simultaneously. How- 
ever, this asymmetric subcomplex is unlikely to 
be observed under non-destabilizing conditions 
and without highly sensitive mass spectrometry 
measurements. 

Enumeration of the topological space of 
protein complexes 

Next, we explored quaternary structure space by 
combining different assembly steps to determine 
which protein complex topologies are possible. 
Given that the protein complex assembly pathways 
described above are dominated by dimerization, 
cyclization, and subunit addition, we focused on 
these three steps. 

An important consideration is interface sym- 
metry. Dimerization results in a twofold axis of 
rotational symmetry, and therefore the interface 
formed by dimerization will be isologous (sym- 
metric or head-to-head) and will involve two 
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Fig. 2. Types of assembly steps observed in homomeric and heteromeric complexes. (A) The five possible types of assembly steps. (B and C) Distribution 
of observed assembly steps for homomers and heteromers from mass spectrometry experiments, from assembly pathways identified in the literature, and 
from complexes with varying quaternary structures in the PDB. Error bars represent 68% Clopper-Pearson confidence intervals. 



identical surfaces on subunits of the same type 
(14). In contrast, cyclization results in higher-order 
rotational symmetry and is associated with in- 
terfaces that are heterologous (asymmetric or 
head-to-tail) and that involve two different sur- 
faces on the same type of subunit. In addition, 
there are heteromeric interfaces, formed between 
two distinct polypeptide chains and hence by 
definition also heterologous. 

Proteins are inherently asymmetric at the level 
of individual polypeptide chains, so we can assume 
that the same interface surface cannot appear on 
the same protein twice, or on two structurally dif- 
ferent proteins. Together with this fundamental 
assumption, the three transitions (dimerization, 
cyclization, and subunit addition) all lead to sym- 
metric protein complexes with even subunit stoi- 
chiometry. This is because subunit addition can 
be viewed as the formation of a larger multipro- 
tein subunit, or protomer, which means that we 
can extend the homomeric definitions of dimer- 
ization and cyclization to homomers formed of 
these multiprotein subunits, leading to equal mul- 
tiples of each type of protein (Fig. 3). 

Every homomeric complex (of single-protein 
or multiprotein subunits) can have at most two 
isologous or heterologous interfaces, because each 
new homomeric interface imposes a new axis of 
rotational symmetry. In other words, symmetry 
constrains the number of homomeric interface 
types to a maximum of two. One or two inter- 
faces of two possible types give us five scenarios: 
(i) one isologous, (ii) one heterologous, (iii) two 
isologous, (iv) two heterologous, and (v) one isol- 
ogous and one heterologous. 



To elucidate all possible heteromeric topologies 
that can arise under these constraints, we started 
by enumerating all tree-like topologies of s sub- 
units, in which each subunit type occurs exactly 
once (Fig. 3). We used trees rather than all pos- 
sible graphs, because we wanted to distinguish 
between essential and nonessential (“circumstan- 
tial”) interfaces in the complex (see next section 
and Methods for details). For each of the five 
scenarios described above, we then considered all 
topologically distinct ways (that is, distinct under 
symmetry operations on the tree) in which the in- 
terfaces can be distributed across the set of sub- 
units and pairs of subunits on the tree. The final 
step was to construct the topologies of the com- 
plexes from these distributions of interfaces across 
the tree. Some of these are isomorphic (taking into 
account interface types and subunit identities), 
which reduces the overall number of topologies. 

An important difference between our idealized 
model and real protein complexes is that real 
complexes can have more interfaces. However, 
we can directly relate real protein topologies to 
the above idealized forms if we consider some 
of the weaker intersubunit contacts to be circum- 
stantial. In other words, stronger, essential in- 
terfaces exist that bind the complex together by 
themselves. We can distinguish between the es- 
sential and circumstantial interfaces by succes- 
sively cutting away, in increasing order of size, as 
many interfaces as possible without giving rise to 
disconnected components of the complex (see 
Methods for details), thereby producing the sim- 
plest possible graph representation of a quater- 
nary structure topology. This contrasts with the 



previous approach of 3D Complex (9), in which 
all intersubunit interfaces are considered. Thus, 
this representation effectively sits above the more 
detailed classification of 3D Complex: A single 
simplified topology used here can correspond 
to multiple 3D Complex topologies. 

Most real protein complexes are compatible 
with our model: 92.5% of homomers and 91.7% 
of heteromers have topologies identified in our 
exhaustive enumeration (Fig. 4). In these com- 
plexes, structurally identical proteins inhabit 
the same topological environment, meaning the 
same local environment in terms of the interfaces 
that they form with other subunits in a complex. 
We therefore define as bijective those complexes 
that have a one-to-one correspondence between 
their polypeptide sequence and their topological 
environment. 

In contrast, all of the real protein complexes 
not compatible with our enumeration are nonbi- 
jective, meaning that sequence-identical subunits 
exist in nonequivalent topological environments 
(Fig. 4). The difference between bijective and non- 
bijective complexes is further illustrated in fig. SI. 

Unlike our simple enumeration model that 
requires only three types of assembly steps, non- 
bijective complexes would require other asym- 
metric fractional transition and nonstoichiometric 
transition assembly steps. To explore this, we per- 
formed an exhaustive enumeration of all possible 
bijective and nonbijective topologies for complexes 
with specific stoichiometries. We found that for 
complexes with 2:2 stoichiometry, there are two 
possible bijective topologies, compared with seven 
possible nonbijective topologies (fig. S2). For 



SCIENCE sciencemag.org 



11 DECEMBER 2015 • VOL 350 ISSUE 6266 aaa2245~3 



RESEARCH \ RESEARCH ARTICLE 



subunit evolution 



resulting assembly 






or 




Fig. 3. Three assembly transitions give 
rise to the topological space of protein 
complexes. These transitions are cycliza- 
tion via a homomeric heterologous interface 
(blue), dimerization via a homomeric isolo- 
gous interface (green), and subunit addi- 
tion via a heteromeric heterologous 
interface (red). We enumerated all possi- 
ble topologies arising from these steps by calculating all ways in which a cyclic or dihedral interface can 
be distributed across a heteromer with 1:1 stoichiometry. For heterodimers, there are two such ways for 
both the cyclization and the dimerization steps. For heterotrimers, there are four such ways for each 
step. In the graph representation of the enumeration step, the possible locations of the distributed 
interfaces are indicated by colored dots. 



(showing graph representations only) 



complexes with 3:3 stoichiometry, there are also 
two possible bijective topologies, whereas the num- 
ber of possible nonbijective topologies rises sharply 
to 250 (fig. S3). This illustrates a major benefit of 
our approach: By limiting our model to only three 
simple assembly steps, we are able to cover most 
observed protein complexes with a much smaller 
set of possible quaternary structure topologies. 

To further justify our classification into bijec- 
tive and nonbijective complexes, we used the fact 
that the quaternary structure assigned to a pro- 
tein complex is often incorrect and does not rep- 
resent the quaternary structure in solution or 
within the cell (15, 16). Using a database of man- 
ually confirmed quaternary structure assignments 
(17), complemented by additional manual assign- 
ments of our own, we compared error rates for 



bijective and nonbijective homomers and hetero- 
mers (Fig. 4). We found that, whereas bijective 
complexes have a low rate of quaternary structure 
error (-10%), more than half of the nonbijective 
structures are the result of errors. Thus, most non- 
bijective protein complex structures are not gen- 
uine examples of biological asymmetry but instead 
are due to artifacts or errors in the structure de- 
termination process. This also suggests that a 
protein complex’s nonbijective status could be 
very useful for identifying likely quaternary struc- 
ture assignment errors. 

The nonbijective complexes with uneven stoichi- 
ometry are an exception to the above, with only a 
20% quaternary structure error rate. We recently 
studied these in detail and determined several 
different structural mechanisms by which they 



can form, which include varying degrees of pseu- 
dosymmetry, steric occlusion, and subunit flexibil- 
ity leading to conformational differences between 
identical subunits (18). We found that those com- 
plexes for which a structural mechanism for uneven 
stoichiometry could not be ascertained were 
mostly the result of quaternary structure assign- 
ment errors (18). 

A periodic table of protein complexes 

Analysis of the real and enumerated quaternary 
structure topologies above shows that all bijective 
heteromers can be related to simpler homomeric 
topologies. Specifically, if the different subunit types 
are grouped together as protomers, then the in- 
teractions between protomers will be equivalent 
to a bijective homomer topology or, for cases with 
no subunit repeats, to a monomer. This suggests a 
natural classification for protein complexes: in 
one dimension, by the number of their repeats, 
and within that, by their equivalent homomeric 
topologies; and in the other dimension, by the 
number of unique subunits. Figure 5 illustrates 
this periodic table of protein complexes for all 
topologies with <12 repeats and <4 unique subunits. 
In this classification, complexes related to the 
equivalent homomers are contained in the same 
column of the table, thus allowing the similarities 
between different heteromeric complexes to be 
easily recognized. 

Most symmetry groups are associated with a 
single homomeric topology, except dihedral groups 
for >6 subunits and tetrahedral groups (12 subunits), 
which both have two topologies each. Higher-order 
symmetries, such as the octahedral and icosahedral 
groups, only appear for 24 or more subunits and 
can have three or more topologies each. Although 
the graph representation of the topology incorpo- 
rates similarities between binding surfaces based 
on the identities of interacting residues, it does not 
require any nonlocal geometric information. Our 
graph representations therefore inherently include 
the symmetry group information of a complex, 
and they constitute the first network representa- 
tion of complexes to do this. 

Figure S4 shows the frequencies of each equiv- 
alent homomer symmetry group for complexes with 
varying numbers of unique subunits. Complexes 
with different numbers of unique subunits have 
similar distributions. Thus, homomers and hetero- 
mers populate the horizontal axis of the periodic 
table in a similar manner, although complexes with 
more unique subunits do tend to have fewer repeats. 

The regions of the periodic table that corre- 
spond to higher numbers of repeats and subunits 
are sparsely populated. This can be attributed to 
two factors. First, there is a considerable bias 
among structurally characterized protein com- 
plexes toward those with smaller numbers of 
unique subunits, whereas evidence suggests that 
protein complexes in vivo will tend to have more 
unique components (19, 20). Second, as shown in 
fig. S4, topologies toward the right of the periodic 
table tend to be less common, which suggests that 
cyclic or dihedral complexes with more repeated 
subunits may be less stable or more difficult to 
evolve. These regions can also be expected to be 
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Bijectivity 



bijective homomer 

one sequence, one topological environment 



non-bijective homomer 

one sequence, more than one topological 
environment 



Occurrence Error rate** 
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bijective heteromer 

multiple sequences, which map bijectively (i.e. one-to-one) 
to topological environments 



non-bijective heteromer with uneven stoichiometry 

multiple sequences, which do not map bijectively (i.e. one- 
to-one) to topological environments, and do not all appear 
an equal number of times. 



non-bijective heteromer with even stoichiometry 

multiple sequences, which do not map bijectively (i.e. one- 
to-one) to topological environments, but do all appear an 
equal number of times. 



91.7% 
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20.4% (20.1% 
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In this example the central protein forms two different interfaces with the two outer proteins due to the 
inherent asymmetry of proteins. The topological environments of the outer proteins therefore differ. 

** values in brackets exclude 'probably yes' and 'probably no' error assignments in PiQSi from the analysis 



Fig. 4. Frequencies of protein complex types and 
their quaternary structure assignment error rates. 

Among nonbijective heteromers, we further distinguished 
between those with even stoichiometry and those with 
uneven stoichiometry. The former are much more like to 
be the result of quaternary structure assignment errors. The latter are more likely to represent a biologically relevant quaternary structure. In the last column, we give 
alternative error rates in brackets that exclude the PiQSi ( 17 ) error assignments “probably yes” and “probably no” from the analysis. These error rates follow the same 
pattern for nonbijective heteromers of both even and uneven stoichiometries. 



filled in coming years, at least to a certain extent. 
Figure 5A shows the rate at which new topologies 
have been discovered— roughly four per year for 
the past 20 years, with no signs of slowing. To 
illustrate the space of possible topologies, the num- 
ber of discovered topologies versus the number 
possible as determined through exhaustive enu- 
meration is shown in each cell of the periodic 
table (an example is shown in Fig. 5B). 

This table is not “periodic” in the same sense 
as the periodic table of the elements, because it is 
in principle open-ended, as opposed to periodic 
with respect to atomic number. There are no 
theoretical limitations to quaternary structure 
topology space in either dimension, although the 
vast majority of known structures can be placed 
on the table in Fig. 5. In fig. S 5, we have provided 
an expanded version of the periodic table, where 
complexes with up to 14 unique subunits and 48 
subunit repeats can be visualized. We believe 
that the analogy to the periodic table of the elem- 
ents is useful, because it provides a means of or- 
ganizing quaternary structure topologies and 
visualizing similarities. Furthermore, just as the 
periodic table of the elements has successfully 
predicted many new chemical elements, our pe- 
riodic table of proteins has considerable predic- 
tive power by revealing the regions of quaternary 
structure space that remain to be populated. 



We showed above that the majority of non- 
bijective complexes are the result of quaternary 
structure assignment errors. The exception to 
this is complexes with uneven stoichiometry, most 
of which represent genuine cases of biological 
asymmetry. Therefore, we sought to reconcile un- 
even stoichiometry with our periodic table. We 
found that if we consider the periodic table at the 
level of local topological environments, rather 
than at the level of subunits, then two sequence- 
identical subunits can play different roles within 
the graph representing the complex. Examination 
of the topologies of nonbjiective complexes re- 
vealed that many of them were equivalent to the 
same symmetric homomer topologies observed 
for the bijective periodic table. Figure S6 illustrates 
this with a periodic table made for nonbijective 
heteromers with 2:1 subunit stoichiometry. For 
these cases, the 2:1 protomer can be considered 
analogous to a heterotrimer with three unique 
subunits. The only difference between 2:1 het- 
eromers here and 1:1:1 heteromers from the main 
periodic table (the third row in Fig. 5) is that in 
the 2:1 heteromers, sequence-identical subunits 
sometimes can still form isologous interfaces, 
despite existing in different local environments. 
Thus, the results of our quaternary structure enu- 
meration can be easily applied to complexes with 
uneven stoichiometry, if the repeated subunits 



from the protomer are considered to be different 
subunit types in our enumeration model. 

Predicting likely yet unobserved 
quaternary structure topologies 

The exhaustive enumeration allows us to deter- 
mine what quaternary structure topologies are 
possible, but it does not tell us which are most 
likely or should be most abundant in nature. To 
address this, we adapted our enumeration pro- 
cedure to produce topologies according to the 
observed distribution in the periodic table. We 
know that each cell on the periodic table can be 
defined by a specific set of assembly steps needed 
to build the topologies within that cell. Combin- 
ing the steps in different ways can produce all the 
topologies compatible with a given cell. Therefore, 
we sampled cells of the periodic table according to 
the observed distribution in real complexes, each 
time randomly combining the assembly steps as- 
sociated with each cell. This was repeated 3 x 10 7 
times (details are provided in the Methods). 

All of the quaternary structure topologies pres- 
ent on the periodic table were observed at least 
once in our calculations. In addition to the pre- 
viously observed quaternary structure topologies, 
our model also predicted 579 topologies that 
were not seen in any of the complexes in our data 
set. To independently validate this result, we 
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Fig. 5. Periodic table of protein complexes. All bijective protein complex 
topologies can be arranged according to the number of different subunit types 
(s) and the number of times these subunits are repeated (r). Isologous inter- 
faces between the same subunits (dihedral interfaces) are shown in green, and 
heterologous interfaces between subunits of the same types (cyclic interfaces) 
are shown in blue. Heteromeric interfaces are shown in red, apart from those 
that correspond to a symmetric dimerization (yellow) or to higher-order cy- 
clization (purple). The topologies in the s = 1 row are the equivalent homomers 
of the heteromeric structures in the s > 1 rows. To clarify this equivalence, 
subunits in the heteromers are grouped according to the repeated subcom- 
plexes. In addition, the yellow and purple interfaces of the heteromeric complexes 



highlight interfaces that are dihedral (green) and cyclic (blue) in the equivalent 
homomers. The ratio in the bottom right of each cell indicates the number of 
topologies that have been observed and the total number of possible topologies 
of this type. The table shown here is an excerpt (s < 5; r < 13) of the full table. An 
interactive version of this table with information on the structures represented 
by each topology can be found at http://www.periodicproteincomplexes.org/. 
[Inset (A)] Number of discovered topologies as a function of time, which has 
been steadily increasing at a rate of about four topologies per year for the 
past two decades. [Inset (B)] An illustration of observed topologies versus all 
possible topologies with six repeats and two subunits (r = 6; s = 2). Three of the 
possible five topologies have been observed thus far. 



compiled an extended set of heteromeric com- 
plexes not present in our original data set because 
they were published more recently, because they 
were determined with electron microscopy (which 
we did not initially include), or because they were 
originally excluded based on structural criteria 
(see Methods). 

The extended set of heteromers contained 53 
different quaternary structure topologies, 14 of 
which were not present in the main data set. 
These 14 tended to be among the most highly 
predicted topologies in our model. For example, 
six of them were observed among the top 20 



most likely predicted topologies (Fig. 6), out of a 
total of 579 predicted (P = 2 x 10' 6 ; Fisher’s exact 
test). Figure S7 illustrates how the observed to- 
pologies cluster within the most highly predicted 
rankings, thus supporting the predictive utility of 
our model. 

We also used a complementary approach for 
the prediction of the relative abundances of to- 
pologies within a given cell, which makes fewer 
assumptions but also yields less specific predic- 
tions. We considered the number of distributed 
interfaces (that is, single interfaces that are spread 
across two subunits) and the number of topological 



equivalents (marked by red crosses in fig. S8) of a 
given interface distribution. We compared to- 
pologies pairwise within cells of the periodic table 
with <4 unique subunits and <12 subunit repeats, 
and we counted the instances in which topology A 
had fewer distributed interfaces and equal or 
more topological equivalents than topology B, 
or fewer or equal distributed interfaces and more 
topological equivalents. Out of the 30 such in- 
stances, topology A was more abundant that to- 
pology B 21 times (70% of instances). This is because 
distributed interfaces restrict the order in which 
evolutionary steps can happen, making topologies 
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with more such interfaces rarer. Larger numbers 
of topological equivalents, on the other hand, 
make topologies more common, because there are 
more ways in which such complexes can evolve. 

Finally, to further validate our predictive mod- 
el, we compared the predicted frequencies of 
heteromeric quaternary structure topologies with 
those observed in both the main and extended 
data sets (fig. S9). Overall, the correlations are 
strong, with the predictions recapitulating the 
observed frequencies. Although the predictions 
are partially fitted to the frequencies of topologies 
observed in the main data set, the high correlation 
with the extended data set provides strong 
independent validation of our model. 

Conclusion 

In this study, we have shown that the assembly 
of protein complexes is dominated by three main 
transition types, which in combination can explain 
most observed quaternary structure topologies. 
This also allows a natural organization of protein 
complexes in a periodic table, in which hetero- 
meric protein complexes are grouped according 
to their equivalent homomeric quaternary struc- 
ture topologies. The periodic table illustrates 
both the variety of observed protein complexes 
and the space of possible topologies through ex- 
haustive enumeration, analogous to previous 
strategies of investigating network topologies 
(21, 22). Given that new topologies have been 
discovered at a fairly constant rate of four per 
year over the past two decades (Fig. 5A), we can 
expect new additions to the unfilled or partially 
filled cells of the periodic table in the near future. 
These unfilled or partially filled cells constrain 
the total space of expected protein complexes, 
similar to the proposed upper bound of 10,000 
total types of interacting domain pairs (23). 

A major practical application of the periodic 
table will be in predicting and modeling the quater- 
nary structure of protein complexes. Specific- 
ally, our results show that bijective quaternary 
structure topologies are far more likely to occur 
than nonbijective topologies, despite the fact that 
there are far more possible nonbijective topologies. 
We also provide predictions for the relative like- 
lihoods of different bijective topologies. This 
knowledge can inform the interpretation of high- 
throughput interaction experiments (24) or 
structure-based interaction predictions (25) by 
highlighting the quaternary structure topologies 
that are possible and most likely to occur. Homol- 
ogy information can aid these quaternary struc- 
ture predictions and give further insight into the 
evolution and assembly of complexes, because 
subcomplexes often arise as evolutionary precur- 
sors and assembly intermediates (12). Similarly, 
the periodic table can tell us which evolutionary 
precursor topologies are likely to have given rise 
to a specific complex. The periodic table can also 
provide constraints for multi-subunit docking or 
modeling, both on the relative arrangements of 
subunits and on the overall complex symmetry 
(26-29). Such constraints could be further inte- 
grated into hybrid methods that combine dif- 
ferent experimental measurements (30), such 



Top 20 predicted topologies 

Out of 579 predicted topologies, a total of 14 are observed in the extended data. 
| Six of these observed topologies are among the top 20 predicted. ) 




Fig. 6. The top 20 most likely 
quaternary structure topologies 
from our model that are not 
observed in the main data set. 

Of these top 20, six are 
observed in the extended data 
set, validating the power of the 
model (P = 2 x 1CT 6 ). The other 14 
topologies in the top 20 are also 
expected to occur relatively fre- 
quently in nature and thus to be 
observed soon in experimentally 
determined structures. The 
distribution of all new topologies 
observed in the extended data set 
compared with the expected fre- 
quencies of all predicted topologies 
is shown in fig. S7. 



as electrospray (31) or cross-linking (32) mass 
spectrometry. 

This work could be of substantial utility in the 
bioengineering of protein complexes. The self- 
assembly formalism introduced here allows spe- 
cification of exactly which essential interfaces 
would need to be engineered to form a protein 
complex of a given topology. This could facilitate 
de novo engineering of oligomeric assemblies 
(33-35) and allow for directed modulation of ex- 



isting quaternary structures, stepping either across 
or down the periodic table in an incremental 
manner. 

Despite its strong predictive power, the basic 
periodic table model does not account for -8% of 
known protein complex structures. More than half 
of these exceptions arise as a result of quaternary 
structure assignment errors. A benefit of this ap- 
proach is that it highlights likely quaternary struc- 
ture misassignments, particularly by identifying 
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nonbijective complexes with even subunit stoichi- 
ometry. However, this still leaves ~4% of known 
structures that are correct but are not compatible 
with the periodic table. 

Although nonbijective complexes are possible, 
they are rare, and this should be given consider- 
ation in any protein modeling or engineering at- 
tempts. Related to this, it would be particularly 
interesting to see whether chaperones are more 
frequently involved in the assembly of nonbi- 
jective complexes to stabilize the required asym- 
metric transitions. To model the nonbijective 
protein complexes, additional assembly steps in- 
volving fractional and nonstoichiometric transi- 
tions are needed. However, as we showed, this 
would also greatly expand the number of possible 
quaternary structure topologies. Therefore, we 
consider the periodic table in its current im- 
plementation to be a reasonable compromise, 
allowing the vast majority of existing quaternary 
structure topologies to be explained and the most 
likely unobserved topologies to be predicted. 

Methods 

Mass spectrometry experiments 

The complexes were kindly donated as follows: 
Saccharomyces cerevisiae SAGA deubiquitinat- 
ing module [PDB accession number (ID), 3MHH; 
C. Wolberger, John Hopkins University School of 
Medicine]; Thermits thermophilus MglA/MglB 
complex (PDB ID, 3T1Q; A. Wittinghofer, Max 
Planck Institute for Molecular Physiology); Geo- 
bacillus stearothermophilus PDH El subunit 
(PDB ID, 3DVA; B. Luisi, University of Cambridge); 
Streptococcus pyogenes toxin-antitoxin complex 
(PDB ID, 3Q8X; A Meinhart, Max Planck Institute 
for Medical Research); Saccharomyces cerevisiae 
phosphofructokinase (PDB ID, 3080; T. Schonebeig, 
University of Leipzig); Ruegeria pomeroyi propionyl- 
CoA carboxylase (PDB ID, 3N6R; L. Tong, Columbia 
University); Homo sapiens MSL1-MSL2 complex 
(PDB ID, 4B7Y; J. Kadlec, European Molecular 
Biology Laboratory); Synechococcus elxmgatm ace- 
tylglutamate kinase/PII complex (PDB ID, 2V5H; 
V.Rubio, Instituto de Biomedicina de Valencia); 
and Pseudomonas aeruginosa 3-methylcrotonyl- 
CoA carboxylase (PDB IDs, 3U9T and 3U9S; 
L. Tong, Columbia University). 

The nano-electrospray ionization mass spec- 
trometry experiments on complex disassembly 
and reassembly were performed as previously 
described using quadrupole time-of-flight mass 
spectrometers modified for high-mass/charge 
ratio operation (11). A list of all (sub)complexes 
observed under various experimental conditions 
is provided in table SI. 

Protein complex data sets 

The full set of protein x-ray crystal structures was 
taken from the PDB on 8 August 2012. Only poly- 
peptide chains with at least 30 residues were con- 
sidered. Backbone-only models were ignored, as 
were structures containing nucleic acids or >10% 
non-water heteroatoms. Heteromeric protein com- 
plexes formed by polypeptide cleavage were also 
ignored. In addition, this approach had the effect 
of removing complexes with protein chains that 



lack unique “db_id” sequence identifiers in the 
PDB. Complexes with >59 subunits or that were 
split over multiple PDB entries were excluded. In 
total, the final data set contained 30,469 mono- 
mers, 28,935 homomers, and 5543 heteromers. 
Manual quaternary structure assignments came 
from PiQSi entries with errors assigned as “prob- 
ably yes” or “probably no” (17) and from additional 
manual searching of the literature. 

The size of the interface between each pair of 
subunits from all protein complexes was cal- 
culated with AREAIMOL (36). For each complex, 
all interfaces of the same type were identified by 
calculating the correlation between atom-specific 
buried surface areas for each pair of interfaces. 
Only interfaces >200 A 2 were considered. Two 
interfaces were considered to be of the same 
type if the Pearson correlation between the buried 
surface areas (in terms of equivalent amino acids) 
was >0.7. Interface sizes were averaged over all 
interfaces of the same type. Similarly, homomeric 
interfaces were classified as isologous if the cor- 
relation between the residue-specific buried sur- 
face area for each subunit was >0.7. 

The extended set of quaternary structure to- 
pologies, used to validate our predictive model, 
was taken from a more recent set of protein com- 
plex structures from the PDB on 16 December 
2014. In addition to the crystal structures used in 
the main set, electron microscopy structures were 
also considered here. The constraints on the main 
data set were loosened, so that complexes formed 
via cleavage, as well as complexes containing 
nucleic acids or other heteroatoms, were also in- 
cluded (although only protein chains were con- 
sidered). In total, this extended set possessed 
4214 bijective heteromers (with <4 unique sub- 
units and <12 subunit repeats) not present in 
the main data set. Within this set, there were 
53 different quaternary structure topologies, 14 
of which are new. 

For certain analyses, we used nonredundant 
subsets of the full and extended data sets. Proteins 
were filtered for redundancy at the 50% sequence 
identity level, essentially following a previous ap- 
proach (18, 20). The nonredundant sets were used 
for the histogram in fig. S4, for the comparison 
between observed and predicted results in fig. S9, 
and for fitting the expected distribution of perio- 
dic table quaternary structures in our model. 

Both the main set and the extended set of 
quaternary structure topologies used in this study 
are provided in table S2, in the form of pairwise 
interfaces formed between subunits. 

Determination of assembly pathways 

All of the mass spectrometry and literature- 
identified (dis)assembly pathways involved pro- 
tein complexes of known structure for which the 
subunit composition of at least one subcomplex 
intermediate could be identified. To complement 
this, we also performed a structural analysis in 
which we identified similar protein complexes 
with different quaternary structures. Starting 
from the full set of homomeric and heteromeric 
complexes in our main data set, we searched for 
complexes for which another complex could be 



considered as a subset of the full complex. For 
example, a homodimer was considered to be a 
subset of a homotetramer if the subunits shared 
>90% sequence identity. All of these subset com- 
plexes were considered to be similar to the exper- 
imentally identified subcomplexes for the purpose 
of defining assembly transitions. All of the exper- 
imentally identified subcomplexes and structural 
subsets are provided in table S3. 

Although we observed the subcomplexes formed 
during (dis)assembly, we did not directly observe 
the assembly or disassembly steps that occur in 
solution. However, we inferred these from the sub- 
complexes identified. For every subcomplex and 
full complex, we identified the largest subcomplex 
that can be considered to be a subset of that (sub) 
complex. If no subcomplex was observed, then 
(dis)assembly was assumed to occur via free 
monomers. We then assigned the transition be- 
tween the two states into one of the five cat- 
egories from Fig. 2. All (dis)assembly transitions 
are provided in table S4. 

Interface cutting procedure 

To distinguish subunit interfaces that are cir- 
cumstantial from those that are essential to the 
self-assembly process, we removed interfaces from 
the weighted subunit contact graph of a given 
complex, in increasing order of interface size. We 
skipped interfaces if cutting them would result 
in the complex becoming disconnected, and we 
stopped when no further interfaces could be cut. 

This procedure does not necessarily result in a 
tree-like graph, because the same interface may 
appear several times in the same complex. For 
example, a cyclic ring will not be cut further, 
because all interfaces, which are of the same size, 
would have to be cut at once. 

The advantages of this approach are that (i) it 
greatly simplifies the number of possible topol- 
ogies and unifies similar topologies that would be 
treated differently if all interfaces were taken into 
account; (ii) it is less arbitrary than conventional 
thresholds of interface size, and therefore it is a 
more fundamental description of a complex; and 
(iii) symmetry emerges directly from the graph 
topology through the number of distinct interfaces 
of a subunit. 

Topological enumeration procedure 

The combinations of different interface types that 
can be present in a homomeric structure include 
one symmetric interface (C 2 symmetry), one asym- 
metric interface (cyclic symmetry with more than 
two subunits), one symmetric and one asymmetric 
interface (dihedral or tetrahedral symmetry), two 
symmetric interfaces (dihedral symmetry), and 
two asymmetric interfaces (only in tetrahedral 
symmetry). Larger numbers of interface types 
are not possible in homomers, because of the 
constraints placed on the symmetry of the com- 
plex by these interface types. 

As explained above, any heteromer that is 
formed using a combination of cyclization, di- 
merization, and subunit addition can be repre- 
sented as a homomer of multiple copies of the 
same heteromeric module, in which each subunit 
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type appears exactly once. We therefore enumerated 
all possible heteromeric topologies by carrying out 
the following procedure (illustrated in fig. S8). 

1) We enumerated all trees of s unique sub- 
units. These are the heteromeric modules, mul- 
tiple copies of which are joined together into 
“homomers.” The reason for restricting ourselves 
to trees instead of all possible graphs is that we 
aimed to consider only the most important 
interfaces in the original complexes, by using the 
interface cutting procedure outlined above. This 
will always lead to tree-like structures as the re- 
peated heteromeric modules. 

2) For each of the five possible combinations of 
interface types in homomers discussed above, we 
considered all topologically distinct ways in which 
the interfaces can be distributed across the set of 
subunits and pairs of subunits of the tree. Here, 
topologically distinct means that we cannot con- 
vert two distributions of interfaces into each other 
by only using symmetry operations of the tree. 

3) We constructed the topologies of the com- 
plexes from these distributions of interfaces across 
the tree. For this, we also needed to consider the 
possible number of repetitions for each cyclic in- 
terface (for example, in the case of the ring-like 
complexes of varying size). In complexes with 12 
or more subunits, there can be more than one pair 
of divisors with at least one even divisor (e.g., in the 
case of 12 subunits, three and four, and two and 
six), which leads to several possible topologies for 
the same total number of subunits (for example, D 6 
and T in the r = 12 column of the 5 = 1 row of the 
periodic table). 

4) We distinguished isomorphic topologies. In 
some cases, different distributions of interfaces 
in the enumeration procedure led to isomorphic 
topologies. These were easily identified using an 
isomorphism check, giving us the final enumera- 
tion of topologies. 

The numbers in the bottom right of each cell 
in the periodic table (Fig. 5) give the observed and 
total numbers of different topologies for that par- 
ticular symmetry group and number of subunits s. 

Enumeration of all possible bijective and 
nonbijective topologies 

We enumerated all possible bijective and non- 
bijective topologies with 2:2 stoichiometry, and 
all possible bijective and nonbijective topologies 
with 3:3 stoichiometry and up to six interfaces. 
We did this by considering all possible four-node 
graphs with three to six edges (2:2), shown in fig. 
S2, and all six-node graphs with five or six edges 
(3:3), shown in fig. S3. For these, we considered 
all distributions of equal numbers of two node 
colors (corresponding to the two protein species). 
In addition, for all edges between nodes of the 
same color, we considered two possibilities, corre- 
sponding to isologous and heterologous interfaces. 
Edges can thus have three colorings (hetero- 
meric, homomeric-isologous, and homomeric- 
heterologous). We also considered different 
interface sizes by considering all possible relative 
size ranks (including equal ranks) of the different 
edges and subsequently cutting interfaces ac- 
cording to the same procedure followed in the 



periodic table. An isomorphism check (which in- 
cludes the colorings of nodes and edges) was 
then used to identify distinct topologies in this 
enumeration. Two additional constraints were 

(i) that an equal interface size can appear more 
than once only for one type of subunit pair, and 

(ii) that the same interface size can only appear 
once on each subunit for homomeric-isologous 
and heteromeric interfaces and once or twice for 
homomeric-heterologous interfaces. 

Prediction of expected frequencies of 
quaternary structure topologies 

To predict the expected frequencies of quaternary 
structure topologies, we attempted to recapitulate 
the observed distribution of complexes within cells 
on the periodic table, considering complexes with 
<12 subunit repeats and <4 unique subunits. This 
prediction procedure can be divided into three parts. 

1) Selecting a cell from the periodic table. We 
first randomly selected a structure from the non- 
redundant set of complexes and monomers. The 
row of the periodic table (the number of unique 
subunits s ) was directly taken from this random- 
ly selected structure. However, the column of the 
periodic table was not taken directly from this 
structure. This is because the sampling of cells on 
the periodic table is sparse for the lower rows, 
and thus we would have missed cells with no 
current structures. Instead, each structure was 
classified into one of three groups: first column 
(monomeric or no repeated subunits), cyclic 
(including C 2 ), or dihedral or tetrahedral. Then, 
another structure was randomly selected from 
those in the first row of the table (a monomer or 
homomer) that belong to the same group. This 
structure was used to define the column of the 
periodic table. Thus, the distribution of homo- 
mers defines the distribution of predicted het- 
eromers in the horizontal axis, with a correction 
for the fact that complexes with more subunit 
repeats tend to be less likely to have cyclic or 
dihedral subunit repeats. 

2) Defining the assembly steps. Each cell of the 
periodic table is associated with a specific set of 
subunit addition, dimerization, and/or cycliza- 
tion assembly steps required to get to it from a 
monomer. Therefore, to generate a random quat- 
ernary structure topology that was compatible 
with a given cell, we first randomized the order 
of the assembly steps. There are two exceptions 
to this: (i) for dihdedral topologies where the 
homomer has at least six subunits and only 
isologous interfaces (e.g., a trimer of dimers 
rather than a dimer of trimers), the dimerization 
step must occur before the cyclization step; and 
ii) for tetrahedral complexes, the cyclic trimer- 
ization must occur before the tetramerization. 

3) Constructing a quaternary structure topol- 
ogy, given a defined set of assembly steps. A new 
interface was added to the quaternary structure 
topology for each assembly step. When the sub- 
complex is heteromeric, there are multiple ways 
an interface could be formed. In these cases, the 
subunit(s) to be involved were randomly se- 
lected. For example, if a dimerization step was 
applied to an A-B subcomplex, then a new isol- 



ogous interface could be formed between two A 
subunits or two B subunits, or a pair of identical 
heteromeric interfaces could be formed between 
each pair of A and B subunits. In the latter case, 
the isologous interface is distributed across two 
subunits. In fig. S10, we illustrate the random 
construction process of the quaternary structure 
topology for a single cell of the periodic table. 

All predicted quaternary structure topologies 
are provided in table S5. 

Enumeration-based shorthand notation 
of protein complex topologies 

The representation of interface distributions 
across the subunits, as shown in Fig. 3 and fig. 
S9, allows for a natural shorthand notation of 
topologies. The subunits are labeled A, B, C, etc., 
and cyclic, dihedral, and heteromeric interfaces 
are denoted c, d, and h. The location of an in- 
terface follows the type, and in the case of dis- 
tributed interfaces, both subunits are given. A 
cyclic homomer therefore is cA, the first cyclic 
structure with two subunits in Fig. 3 is cA hAB, 
and the last structure with three subunits in the 
same figure is dAC hAB hBC. 
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COGNITIVE SCIENCE 

Human-level concept learning 
through probabilistic 
program induction 

Brenden M. Lake, 1 * Ruslan Salakhutdinov , 2 Joshua B. Tenenbaum 3 

People learning new concepts can often generalize successfully from just a single example, 
yet machine learning algorithms typically require tens or hundreds of examples to 
perform with similar accuracy. People can also use learned concepts in richer ways than 
conventional algorithms— for action, imagination, and explanation. We present a 
computational model that captures these human learning abilities for a large class of 
simple visual concepts: handwritten characters from the world’s alphabets. The model 
represents concepts as simple programs that best explain observed examples under a 
Bayesian criterion. On a challenging one-shot classification task, the model achieves 
human-level performance while outperforming recent deep learning approaches. We also 
present several “visual Turing tests” probing the model’s creative generalization abilities, 
which in many cases are indistinguishable from human behavior. 



D espite remarkable advances in artificial 
intelligence and machine learning, two 
aspects of human conceptual knowledge 
have eluded machine systems. First, for 
most interesting kinds of natural and man- 
made categories, people can learn a new concept 



from just one or a handful of examples, whereas 
standard algorithms in machine learning require 
tens or hundreds of examples to perform simi- 
larly. For instance, people may only need to see 
one example of a novel two-wheeled vehicle 
(Fig. 1A) in order to grasp the boundaries of the 



new concept, and even children can make mean- 
ingful generalizations via “one-shot learning” 
(1-3). In contrast, many of the leading approaches 
in machine learning are also the most data-hungiy, 
especially “deep learning” models that have 
achieved new levels of performance on object 
and speech recognition benchmarks (4-9). Sec- 
ond, people learn richer representations than 
machines do, even for simple concepts (Fig. IB), 
using them for a wider range of functions, in- 
cluding (Fig. 1, ii) creating new exemplars (10), 
(Fig. 1, iii) parsing objects into parts and rela- 
tions (11), and (Fig. 1, iv) creating new abstract 
categories of objects based on existing categories 
(12, 13). In contrast, the best machine classifiers 
do not perform these additional functions, which 
are rarely studied and usually require special- 
ized algorithms. A central challenge is to ex- 
plain these two aspects of human-level concept 
learning: How do people learn new concepts 
from just one or a few examples? And how do 
people learn such abstract, rich, and flexible rep- 
resentations? An even greater challenge arises 
when putting them together: How can learning 
succeed from such sparse data yet also produce 
such rich representations? For any theory of 
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Fig. 1. People can learn rich concepts from limited data. (A and B) A single example of a new concept (red boxes) can be enough information to support 
the (i) classification of new examples, (ii) generation of new examples, (iii) parsing an object into parts and relations (parts segmented by color), and (iv) 
generation of new concepts from related concepts. [Image credit for (A), iv, bottom: With permission from Glenn Roberts and Motorcycle Mojo Magazine] 
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learning ( 4 , 14-16), fitting a more complicated 
model requires more data, not less, in order to 
achieve some measure of good generalization, 
usually the difference in performance between 
new and old examples. Nonetheless, people seem 
to navigate this trade-off with remarkable agil- 
ity, learning rich concepts that generalize well 
from sparse data. 

This paper introduces the Bayesian program 
learning (BPL) framework, capable of learning 
a large class of visual concepts from just a single 
example and generalizing in ways that are mostly 
indistinguishable from people. Concepts are rep- 
resented as simple probabilistic programs— that 
is, probabilistic generative models expressed as 
structured procedures in an abstract description 
language (17, 18). Our framework brings together 
three key ideas— compositionality, causality, and 
learning to learn— that have been separately influ- 
ential in cognitive science and machine learning 
over the past several decades (19-22). As pro- 
grams, rich concepts can be built “composition- 
ally” from simpler primitives. Their probabilistic 
semantics handle noise and support creative 
generalizations in a procedural form that (unlike 
other probabilistic models) naturally captures 
the abstract “causal” structure of the real-world 
processes that produce examples of a category. 
Learning proceeds by constructing programs that 
best explain the observations under a Bayesian 
criterion, and the model “learns to learn” (23, 24) 
by developing hierarchical priors that allow pre- 
vious experience with related concepts to ease 
learning of new concepts (25, 26). These priors 
represent a learned inductive bias (27) that ab- 
stracts the key regularities and dimensions of 
variation holding across both types of concepts 
and across instances (or tokens) of a concept in a 
given domain. In short, BPL can construct new 
programs by reusing the pieces of existing ones, 
capturing the causal and compositional proper- 



ties of real-world generative processes operating 
on multiple scales. 

In addition to developing the approach sketched 
above, we directly compared people, BPL, and 
other computational approaches on a set of five 
challenging concept learning tasks (Fig. IB). The 
tasks use simple visual concepts from Omniglot, 
a data set we collected of multiple examples of 
1623 handwritten characters from 50 writing 
systems (Fig. 2) (see acknowledgments). Both im- 
ages and pen strokes were collected (see below) as 
detailed in section SI of the online supplementary 
materials. Handwritten characters are well suited 
for comparing human and machine learning on a 
relatively even footing: They are both cognitively 
natural and often used as a benchmark for com- 
paring learning algorithms. Whereas machine 
learning algorithms are typically evaluated after 
hundreds or thousands of training examples per 
class (5), we evaluated the tasks of classification, 
parsing (Fig. IB, iii), and generation (Fig. IB, ii) of 
new examples in their most challenging form: after 
just one example of a new concept. We also in- 
vestigated more creative tasks that asked people and 
computational models to generate new concepts 
(Fig. IB, iv). BPL was compared with three deep 
learning models, a classic pattern recognition 
algorithm, and various lesioned versions of the 
model— a breadth of comparisons that serve to 
isolate the role of each modeling ingredient (see 
section S4 for descriptions of alternative models). 
We compare with two varieties of deep convo- 
lutional networks (28), representative of the cur- 
rent leading approaches to object recognition (7), 
and a hierarchical deep (HD) model (29), a prob- 
abilistic model needed for our more generative 
tasks and specialized for one-shot learning. 

Bayesian Program Learning 

The BPL approach learns simple stochastic pro- 
grams to represent concepts, building them com- 



positionally from parts (Fig. 3A, iii), subparts 
(Fig. 3A, ii), and spatial relations (Fig. 3A, iv). 
BPL defines a generative model that can sam- 
ple new types of concepts (an “A,” “B,” etc.) by 
combining parts and subparts in new ways. 
Each new type is also represented as a genera- 
tive model, and this lower-level generative model 
produces new examples (or tokens) of the con- 
cept (Fig. 3A, v), making BPL a generative model 
for generative models. The final step renders 
the token-level variables in the format of the raw 
data (Fig. 3A, vi). The joint distribution on types 
\| /, a set of M tokens of that type 0 (1) , . . ., 0 (M) , 
and the corresponding binary images 7 (1) , . . ., / (M) 
factors as 

p(\|/, 

= p(\(f)np(/ (m) |e (m) )p(e (m) |\(f) ^ 

The generative process for types P(\j/) and 
tokens P(0 (m) |\|/) are described by the pseudocode 
in Fig. 3B and detailed along with the image 
model P( / (m) |e (w) ) in section S2. Source code is 
available online (see acknowledgments). The 
model learns to learn by fitting each condition- 
al distribution to a background set of characters 
from 30 alphabets, using both the image and the 
stroke data, and this image set was also used to 
pretrain the alternative deep learning models. 
Neither the production data nor any alphabets 
from this set are used in the subsequent evalu- 
ation tasks, which provide the models with only 
raw images of novel characters. 

Handwritten character types \\f are an abstract 
schema of parts, subparts, and relations. Reflecting 
the causal structure of the handwriting process, 
character parts Si are strokes initiated by pres- 
sing the pen down and terminated by lifting it up 
(Fig. 3A, iii), and subparts %, ..., s in . are more 
primitive movements separated by brief pauses of 
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Fig. 2. Simple visual concepts for comparing human and machine learning. 525 (out of 1623) character concepts, shown with one example each. 
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the pen (Fig. 3 A, ii). To construct a new character 
type, first the model samples the number of parts 
k and the number of subparts n h for each part 
i = 1, k, from their empirical distributions as 



measured from the background set. Second, a 
template for a part Si is constructed by sampling 
subparts from a set of discrete primitive actions 
learned from the background set (Fig. 3A, i), 



such that the probability of the next action 
depends on the previous. Third, parts are then 
grounded as parameterized curves (splines) by 
sampling the control points and scale parameters 
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procedure GenerateType 
k <- P(k) > Sample number of parts 

for i = 1 ... k do 

m <- P(rii\K) > Sample number of sub-parts 

for j = 1 ... rii do 






<- P(s ij \s i ( j _ 1 )) > Sample sub-part sequence 



end for 

Ri P(R i \S 1 ,...,S i - 1 ) 

end for 

{«, #, S'} 

return @GenerateToken(^) 



> Sample relation 



> Return program 



A 

tu r^ 




5L-5 5 - sir 



procedure GenerateToken(^) 
for i = i...k do 

s (rn) p^girn) ^ > Add motor variance 



L (m) 

> Sample part’s start location 
T t {m) <- > Compose a part’s trajectory 

end for 

A ( m ) <- P(A (m) ) > Sample affine transform 

j(m) p(/(m)| T (m) 5 ^(m)^ > Sample image 

return I 




Fig. 3. A generative model of handwritten characters. (A) New types are generated by choosing primitive actions (color coded) from a library (i), 
combining these subparts (ii) to make parts (iii), and combining parts with relations to define simple programs (iv). New tokens are generated by running 
these programs (v), which are then rendered as raw data (vi). (B) Pseudocode for generating new types y and new token images / (m) for m = 1, ... , M. The 
function f (-, ■) transforms a subpart sequence and start location into a trajectory. 



Training item with model’s five best parses 




Fig. 4. Inferring motor programs from images. Parts are distinguished 
by color, with a colored dot indicating the beginning of a stroke and an 
arrowhead indicating the end. (A) The top row shows the five best pro- 
grams discovered for an image along with their log-probability scores 
(Eq. 1). Subpart breaks are shown as black dots. For classification, each 
program was refit to three new test images (left in image triplets), and 
the best-fitting parse (top right) is shown with its image reconstruction 
(bottom right) and classification score (log posterior predictive probability). 
The correctly matching test item receives a much higher classification 
score and is also more cleanly reconstructed by the best programs induced 
from the training item. (B) Nine human drawings of three characters 
(left) are shown with their ground truth parses (middle) and best model 
parses (right). 
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for each subpart. Last, parts are roughly positioned 
to begin either independently, at the beginning, at 
the end, or along previous parts, as defined by 
relation R 7 (Fig. 3A, iv). 

Character tokens 0 (m) are produced by execut- 
ing the parts and the relations and modeling how 
ink flows from the pen to the page. First, motor 
noise is added to the control points and the scale 
of the subparts to create token-level stroke tra- 
jectories S (m) . Second, the trajectory’s precise start 
location L im) is sampled from the schematic pro- 
vided by its relation to previous strokes. Third, 
global transformations are sampled, including 
an affine warp A (m) and adaptive noise parame- 
ters that ease probabilistic inference (30). Last, a 
binary image I (m) is created by a stochastic ren- 
dering function, lining the stroke trajectories 
with grayscale ink and interpreting the pixel 
values as independent Bernoulli probabilities. 

Posterior inference requires searching the large 
combinatorial space of programs that could have 
generated a raw image I (m) . Our strategy uses fast 
bottom-up methods (31) to propose a range of 
candidate parses. The most promising candidates 
are refined by using continuous optimization 
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and local search, forming a discrete approxima- 
tion to the posterior distribution P(yz, 0 (m) |/ (m) ) 
(section S3). Figure 4A shows the set of discov- 
ered programs for a training image 7 (1) and 
how they are refit to different test images / (2) to 
compute a classification score log P(I (2) \I a) ) (the 
log posterior predictive probability), where higher 
scores indicate that they are more likely to be- 
long to the same class. A high score is achieved 
when at least one set of parts and relations can 
successfully explain both the training and the 
test images, without violating the soft constraints 
of the learned within-class variability model. 
Figure 4B compares the model’s best-scoring 
parses with the ground-truth human parses for 
several characters. 

Results 

People, BPL, and alternative models were com- 
pared side by side on five concept learning tasks 
that examine different forms of generalization 
from just one or a few examples (example task 
Fig. 5). All behavioral experiments were run 
through Amazon’s Mechanical Turk, and the ex- 
perimental procedures are detailed in section S5. 
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The main results are summarized by Fig. 6, and 
additional lesion analyses and controls are re- 
ported in section S6. 

One-shot classification was evaluated through 
a series of within-alphabet classification tasks for 
10 different alphabets. As illustrated in Fig. IB, i, 
a single image of a new character was presented, 
and participants selected another example of that 
same character from a set of 20 distinct char- 
acters produced by a typical drawer of that alpha- 
bet. Performance is shown in Fig. 6A, where chance 
is 95% errors. As a baseline, the modified Hausdorff 
distance (32) was computed between centered 
images, producing 38.8% errors. People were 
skilled one-shot learners, achieving an average 
error rate of 4.5% (N = 40). BPL showed a similar 
error rate of 3.3%, achieving better performance 
than a deep convolutional network (convnet; 13.5% 
errors) and the HD model (34.8%)— each adapted 
from deep learning methods that have performed 
well on a range of computer vision tasks. A deep 
Siamese convolutional network optimized for this 
one-shot learning task achieved 8.0% errors (33), 
still about twice as high as humans or our model. 
BPL’s advantage points to the benefits of modeling 
the underlying causal process in learning concepts, 
a strategy different from the particular deep learn- 
ing approaches examined here. BPL’s other key 
ingredients also make positive contributions, as 
shown by higher error rates for BPL lesions 
without learning to learn (token-level only) or 
compositionality (11.0% errors and 14.0%, respec- 
tively). Learning to learn was studied separately 
at the type and token level by disrupting the 
learned hyperparameters of the generative model. 
Compositionality was evaluated by comparing 
BPL to a matched model that allowed just one 
spline-based stroke, resembling earlier analysis- 
by-synthesis models for handwritten characters 
that were similarly limited (34, 35). 

The human capacity for one-shot learning is 
more than just classification. It can include a suite 
of abilities, such as generating new examples of a 
concept. We compared the creative outputs pro- 
duced by humans and machines through ‘Visual 
Turing tests,” where naive human judges tried to 
identify the machine, given paired examples of 
human and machine behavior. In our most basic 
task, judges compared the drawings from nine 
humans asked to produce a new instance of a 
concept given one example with nine new ex- 
amples drawn by BPL (Fig. 5). We evaluated each 
model based on the accuracy of the judges, which 
we call their identification (ID) level: Ideal model 
performance is 50% ID level, indicating that they 
cannot distinguish the model’s behavior from 
humans; worst-case performance is 100%. Each 
judge (N = 147) completed 49 trials with blocked 
feedback, and judges were analyzed individually 
and in aggregate. The results are shown in Fig. 
6B (new exemplars). Judges had only a 52% ID 
level on average for discriminating human versus 
BPL behavior. As a group, this performance was 
barely better than chance [£( 47) = 2.03, P = 0.048], 
and only 3 of 48 judges had an ID level reliably 
above chance. Three lesioned models were eval- 
uated by different groups of judges in separate 
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Fig. 5. Generating new exemplars. Humans and machines were given an image of a novel character 
(top) and asked to produce new exemplars. The nine-character grids in each pair that were generated by 
a machine are (by row) 1, 2; 2, 1; 1, 1. 
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conditions of the visual Turing test, examining 
the necessity of key model ingredients in BPL. 
Two lesions, learning to learn (token-level only) 
and compositionality, resulted in significantly 
easier Turing test tasks (80% ID level with 17 of 19 
judges above chance and 65% with 14 of 26, re- 
spectively), indicating that this task is a nontrivial 
one to pass and that these two principles each 
contribute to BPL’s human-like generative profi- 
ciency. To evaluate parsing more directly (Fig. 4B), 
we ran a dynamic version of this task with a dif- 
ferent set of judges (N = 143), where each trial 
showed paired movies of a person and BPL draw- 
ing the same character. BPL performance on this 
visual Turing test was not perfect (59% average 
ID level; new exemplars (dynamic) in Fig. 6B), 
although randomizing the learned prior on stroke 
order and direction significantly raises the ID level 
(71%), showing the importance of capturing the 
right causal dynamics for BPL. 

Although learning to learn new characters from 
30 background alphabets proved effective, many 
human learners will have much less experience: 
perhaps familiarity with only one or a few alpha- 
bets, along with related drawing tasks. To see 
how the models perform with more limited expe- 
rience, we retrained several of them by using two 
different subsets of only five background alpha- 
bets. BPL achieved similar performance for one- 
shot classification as with 30 alphabets (4.3% 
and 4.0% errors, for the two sets, respectively); in 
contrast, the deep convolutional net performed 
notably worse than before (24.0% and 22.3% 
errors). BPL performance on a visual Turing test 
of exemplar generation (N = 59) was also similar 
on the first set [52% average ID level that was 
not significantly different from chance £(26) = 1.04, 
P > 0.05], with only 3 of 27 judges reliably above 
chance, although performance on the second set 
was slightly worse [57% ID level; £(31) = 4.35, P < 
0.001; 7 of 32 judges reliably above chance]. 
These results suggest that although learning to 
learn is important for BPL’s success, the model’s 
structure allows it to take nearly full advantage 
of comparatively limited background training. 

The human productive capacity goes beyond 
generating new examples of a given concept: 
People can also generate whole new concepts. 
We tested this by showing a few example char- 
acters from 1 of 10 foreign alphabets and asking 
participants to quickly create a new character 
that appears to belong to the same alphabet (Fig. 
7A). The BPL model can capture this behavior by 
placing a nonparametric prior on the type level, 
which favors reusing strokes inferred from the 
example characters to produce stylistically con- 
sistent new characters (section S7). Human judges 
compared people versus BPL in a visual Turing 
test (N = 117), viewing a series of displays in the 
format of Fig. 7 A, i and iii. The judges had only a 
49% ID level on average [Fig. 6B, new concepts 
(from type)], which is not significantly different 
from chance [£(34) = 0.45, P > 0.05]. Individually, 
only 8 of 35 judges had an ID level significantly 
above chance. In contrast, a model with a lesion 
to (type-level) learning to learn was successfully 
detected by judges on 69% of trials in a separate 



condition of the visual Turing test, and was sig- 
nificantly easier to detect than BPL (18 of 25 judges 
above chance). Further comparisons in section 
S6 suggested that the model’s ability to produce 
plausible novel characters, rather than stylistic 
consistency per se, was the crucial factor for 
passing this test. We also found greater variation 
in individual judges’ comparisons of people and 
the BPL model on this task, as reflected in their 
ID levels: 10 of 35 judges had individual ID levels 
significantly below chance; in contrast, only two 
participants had below-chance ID levels for BPL 
across all the other experiments shown in Fig. 6B. 

Last, judges (N = 124) compared people and 
models on an entirely free-form task of generat- 
ing novel character concepts, unconstrained by a 
particular alphabet (Fig. 7B). Sampling from the 
prior distribution on character types P(\j/) in BPL 
led to an average ID level of 57% correct in a visual 
Turing test (11 of 32 judges above chance); with 
the nonparametric prior that reuses inferred parts 
from background characters, BPL achieved a 51% 
ID level [Fig. 7B and new concepts (unconstrained) 
in Fig. 6B; ID level not significantly different 
from chance £(24) = 0.497, P > 0.05; 2 of 25 judges 
above chance]. A lesion analysis revealed that both 
compositionality (68% and 15 of 22) and learning 
to learn (64% and 22 of 45) were crucial in passing 
this test. 

Discussion 

Despite a changing artificial intelligence land- 
scape, people remain far better than machines at 



learning new concepts: They require fewer exam- 
ples and use their concepts in richer ways. Our 
work suggests that the principles of composi- 
tionality, causality, and learning to learn will be 
critical in building machines that narrow this gap. 
Machine learning and computer vision research- 
ers are beginning to explore methods based on 
simple program induction (36-41), and our results 
show that this approach can perform one-shot 
learning in classification tasks at human-level ac- 
curacy and fool most judges in visual Turing tests 
of its more creative abilities. For each visual Turing 
test, fewer than 25% of judges performed signif- 
icantly better than chance. 

Although successful on these tasks, BPL still 
sees less structure in visual concepts than people 
do. It lacks explicit knowledge of parallel lines, 
symmetry, optional elements such as cross bars 
in “7”s, and connections between the ends of 
strokes and other strokes. Moreover, people 
use their concepts for other abilities that were 
not studied here, including planning (42), expla- 
nation (43), communication (44), and conceptual 
combination (45). Probabilistic programs could 
capture these richer aspects of concept learning 
and use, but only with more abstract and complex 
structure than the programs studied here. More- 
sophisticated programs could also be suitable for 
learning compositional, causal representations of 
many concepts beyond simple perceptual categories. 
Examples include concepts for physical artifacts, 
such as tools, vehicles, or furniture, that are well 
described by parts, relations, and the functions 
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Fig. 6. Human and machine performance was compared on (A) one-shot classification and (B) 
four generative tasks. The creative outputs for humans and models were compared by the percent of 
human judges to correctly identify the machine. Ideal performance is 50%, where the machine is 
perfectly confusable with humans in these two-alternative forced choice tasks (pink dotted line). Bars 
show the mean ± SEM [ N - 10 alphabets in (A)]. The no learning-to-learn lesion is applied at different 
levels (bars left to right): (A) token; (B) token, stroke order, type, and type. 
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these structures support; fractal structures, such 
as rivers and trees, where complexity arises from 
highly iterated but simple generative processes; 
and even abstract knowledge, such as natural 
number, natural language semantics, and intuitive 
physical theories (17, 46-48). 

Capturing how people learn all these concepts 
at the level we reached with handwritten charac- 
ters is a long-term goal. In the near term, applying 
our approach to other types of symbolic concepts 
may be particularly promising. Human cultures 
produce many such symbol systems, including 
gestures, dance moves, and the words of spoken 
and signed languages. As with characters, these 
concepts can be learned to some extent from 
one or a few examples, even before the symbolic 
meaning is clear: Consider seeing a “thumbs up,” 



“fist bump,” or “high five” or hearing the names 
“Boutros Boutros-Ghali,” “Kofi Annan,” or “Ban 
Ki-moon” for the first time. From this limited 
experience, people can typically recognize new 
examples and even produce a recognizable sem- 
blance of the concept themselves. The BPL prin- 
ciples of compositionality, causality, and learning 
to learn may help to explain how. 

To illustrate how BPL applies in the domain of 
speech, programs for spoken words could be 
constructed by composing phonemes (subparts) 
systematically to form syllables (parts), which 
compose further to form morphemes and entire 
words. Given an abstract syllable-phoneme parse 
of a word, realistic speech tokens can be gener- 
ated from a causal model that captures aspects of 
speech motor articulation. These part and subpart 



components are shared across the words in a 
language, enabling children to acquire them 
through a long-term learning-to-learn process. 
We have already found that a prototype model 
exploiting compositionality and learning to learn, 
but not causality, is able to capture some aspects 
of the human ability to learn and generalize new 
spoken words (e.g., English speakers learning 
words in Japanese) (49). Further progress may 
come from adopting a richer causal model of 
speech generation, in the spirit of classic “analysis- 
by-synthesis” proposals for speech perception and 
language comprehension (20, 50). 

Although our work focused on adult learners, 
it raises natural developmental questions. If chil- 
dren learning to write acquire an inductive bias 
similar to what BPL constructs, the model could 
help explain why children find some characters 
difficult and which teaching procedures are most 
effective (51). Comparing children’s parsing and 
generalization behavior at different stages of 
learning and BPL models given varying back- 
ground experience could better evaluate the model’s 
leaming-to-leam mechanisms and suggest improve- 
ments. By testing our classification tasks on infants 
who categorize visually before they begin drawing 
or scribbling (52), we can ask whether children learn 
to perceive characters more causally and composi- 
tionally based on their own proto-writing experi- 
ence. Causal representations are prewired in our 
current BPL models, but they could conceivably 
be constructed through learning to learn at an 
even deeper level of model hierarchy (53). 

Last, we hope that our work may shed light on 
the neural representations of concepts and the 
development of more neurally grounded learning 
models. Supplementing feedforward visual pro- 
cessing (54), previous behavioral studies and our 
results suggest that people learn new handwritten 
characters in part by inferring abstract motor 
programs (55), a representation grounded in pro- 
duction yet active in purely perceptual tasks, 
independent of specific motor articulators and 
potentially driven by activity in premotor cortex 
(56-58). Could we decode representations struc- 
turally similar to those in BPL from brain imaging 
of premotor cortex (or other action-oriented regions) 
in humans perceiving and classifying new char- 
acters for the first time? Recent large-scale brain 
models (59) and deep recurrent neural networks 
(60-62) have also focused on character recogni- 
tion and production tasks— but typically learning 
from large training samples with many examples 
of each concept. We see the one-shot learning 
capacities studied here as a challenge for these 
neural models: one we expect they might rise to 
by incorporating the principles of composition- 
ality, causality, and learning to learn that BPL 
instantiates. 
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Fig. 7. Generating new concepts. (A) Humans and machines were given a novel alphabet (i) and asked 
to produce new characters for that alphabet. New machine-generated characters are shown in (ii). 
Human and machine productions can be compared in (iii). The four-character grids in each pair that 
were generated by the machine are (by row) 1, 1, 2; 1, 2. (B) Humans and machines produced new 
characters without a reference alphabet. The grids that were generated by a machine are 2; 1; 1; 2. 
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T he central concept of the transition state in 
chemical kinetics is familiar to all students 
of chemistry. Since its inception by Arrhe- 
nius (i) and later development into a full 
theory by Eyring, Wigner, Polanyi, and Evans 
(2-5), the idea that the thermal rate depends 
primarily on the highest point along the lowest- 
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energy path from reactants to products has re- 
mained essentially unchanged. Most of chemical 
dynamics is now firmly based on this idea of the 
transition state, notwithstanding the emergence 
of unconventional reactions such as roaming (6, 7), 
where a photodissociated atom wanders before 
abstracting from the parent fragment. Despite the 
clear importance of the transition state to the field 
of chemistry, direct experimental studies of the 
transition state and its properties are scarce (8). 

Here, we report the observation of a vibrational 
pattern, a dip in the trend of quantum level spac- 
ings, which occurs at the energy of the saddle 
point. This phenomenon is expected to provide a 
generally applicable and accurate method for 
characterizing transition states. Only a subset of 
vibrational states exhibit a dip; these states contain 
excitation along the reaction coordinate and are 
barrier-proximal, meaning that they are more 
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susceptible than other states to the effects of 
the isomerization barrier. Experimental evidence 
for this concept is drawn from our studies of 
two prototypical systems: the HCN <-► HNC 
isomerization and the cis-trans conformational 
change in the first electronically excited singlet 
state of acetylene. 

Effective frequency and the 
isomerization dip 

The effective frequency co eff is the central quan- 
tity in our model for the spectroscopic signature 
of isomerizing systems. In a one-dimensional 
system, the effective frequency is the derivative of 
the energy with respect to the quantum number n, 



af^(n) 



dE _AE 
dn An 



(i) 



where co eff is evaluated discretely for quantized 
systems. co eff is a dynamic quantity that can change 
as excitation increases, unlike quantities such as 
harmonic frequency, co, or fundamental frequency, 
v, which are often listed as molecular constants. 
As such, it is a useful diagnostic of the behavior of 
the system. 



Applications of effective frequency date back a 
long way. For example, the effective frequencies 
co e V) of a state of a diatomic molecule are its 
vibrational intervals, which decrease to zero at 
the dissociation limit. The sum of the effective 
frequencies is therefore the dissociation energy. 
In most cases it is not possible to observe co ef£ (n) 
all the way to the dissociation limit, but a linear 
extrapolation to co eff = 0 allows a very good esti- 
mate of the dissociation energy, notwithstanding 
nonlinearities in the trend of vibrational intervals 
near dissociation. This is the basis of the Birge- 
Sponer plot (9) where the area under a graph of 
the vibrational intervals, co eff (7z), against n gives 
the dissociation energy. Leroy and Bernstein 
(10) have given a protocol for extrapolating the 
effective frequencies, which takes account of the 
exact long-range shape of the vibrational potential 
near dissociation. This procedure is found to give 
very accurate dissociation energies (11). 

Effective frequencies also play a large part in 
our understanding of quasi-linear molecules. A 
quasi-linear molecule has a nonlinear equilibrium 
geometry but a comparatively small potential bar- 
rier to linearity. The pattern of the lowest vibra- 
tional levels is that of a bent molecule, but with 



increasing bending vibrational excitation, this 
changes smoothly into the pattern for a linear 
molecule, vibrating with large amplitude. Dixon 
(12) modeled a quasi-linear potential as a two- 
dimensional harmonic oscillator perturbed by a 
Gaussian hump at the linear configuration, and 
calculated its energy levels. These levels may be 
assigned vibrational (v) and angular momentum 
(K) quantum numbers (13). If the vibrational 
intervals (effective frequencies) for a given K value 
are plotted against v, they pass through a mini- 
mum at the energy of the potential barrier, 
thereby allowing determination of its value. The 
depth of this “Dixon dip” is greatest for K = 0 
and decreases with increasing K. The reason is 
that the angular momentum results in a in- 
dependent centrifugal barrier at the linear con- 
figuration, which the molecule must avoid. 

We now illustrate the concept of effective 
frequency in more detail, with the four types of 
potential shown in Fig. 1. For the harmonic 
oscillator, 
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Fig. 1. Effective frequency plots below their associated model potentials. (A) Harmonic oscillator. 
(B) Morse oscillator. (C) Symmetric double minimum potential. (D) Asymmetric double minimum potential. In 
the top row, the quantized energy levels are marked with dashed lines. In the bottom row, the classical co eff is 
shown as a solid line, with the quantum level spacings plotted as open circles versus E (the midpoint energy 
for each interval). In (C), the upper and lower series of circles correspond to the vibrational level spacings 
and tunneling splittings, respectively. In (D), the co eff curve and energy levels for the second minimum are 
shown in red, and the quantum level spacings are overlaid on the co eff curves as triangles and squares. 
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Fig. 2. Relationship between potential shape and co eff . (A) Effective frequency curves. (B) The 
corresponding potentials as a function of the shape parameter m in Eq. 4. 
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indicating that the dynamics of the system do 
not change as a function of energy. For a Morse 
oscillator, the potential is V(r) = D e [l - exp(-ar)] 2 , 
where D e is the dissociation energy, a is a length 
parameter, and r is the bond length displacement. 
In this case, 



co ( n -\ — J + x\n + ^ 



■ _ d 
dn 

= co + x + 2nx 



(3a) 



and 



d>co eff 

dn 



= 2x 



(3b) 



where x is always negative. This linear decrease of 
co eff with n reflects the migration of the Morse 
wave functions toward the softer outer turning 
point. When co eff reaches zero at the dissociation 
limit, it becomes clear that the Morse and har- 
monic oscillators display very different dynamics. 

Simple expressions for co eff and dof^/dn can- 
not be derived for the other cases in Fig. 1, but 
these illustrate the most important feature even 
more clearly: The effective frequency goes to zero 
at the energy of each stationary point on the 
potential. Classically, this can be understood by 
imagining a ball released to roll on a double- 
minimum surface. If the ball starts on one side at 
exactly the height of a local maximum, it will 
reach that maximum with zero kinetic energy 
and stop. Because the ball never returns, the oscil- 
lation period is infinite and the frequency is 
therefore zero. We see immediately that this ap- 
plies to the Morse oscillator as well: co eff reaches 
zero at the dissociation limit, which is a hori- 
zontal asymptote of V(r). It is clear that this 
phenomenon is quite general and that zeros or 
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abrupt changes in co eff signal important changes 
in the dynamics of the system. 

In their quantum and semiclassical analysis of 
highly excited states of HCP, Jacobson and Child 
(14) mentioned a dip in co eff as the signature of an 
approach to a saddle point. Because the HCP <-> 
HPC potential energy surface exhibits some un- 
usual features (HPC is a saddle point rather than 
a second minimum) and is not a true isomeri- 
zation (15), the observed co eff trend was catego- 




q 3 + q 6 % 0 

Fig. 3. Salient features of the cis-trans isomer- 
ization in Si C 2 H 2 .The barrier height and energy 
difference between the two conformers are shown, 
as well as the combination of trans normal modes 
(c /3 and q 6 ) that corresponds to the isomerization 
coordinate. 



rized as a peculiar “Dixon dip” rather than being 
recognized as the universal signature proposed 
here. Similarly, the onset of internal rotation in 
the ground state of SiC 2 (16) is not an isomer- 
ization, although the ideas presented here are 
applicable to it. More generally, the behavior of 
systems as they encounter stationary points 
has been investigated from other perspectives 
as well (17-20). For our purposes, it suffices that 
this dip in co eff provides a marker of the chemi- 
cally relevant transition state energy, as we dem- 
onstrate below. 

A model for measurement of the 
transition state energy 

To determine the transition state energy, we 
propose the following semiempirical formula for 
co eff as a function of energy, E, defined as the 
midpoint energy for each vibrational interval: 

/ e \ 1/m 

co 0 , -Ets ^ 0; 2 < m < co; E < E TS (4) 

where co 0 is the effective frequency at E = 0 for 
the progression being analyzed, E TS is the en- 
ergy of the transition state, and ra is a param- 
eter related to the barrier shape. For the Morse 
oscillator (Fig. IB), ra = 2 analytically (21-23), 
E ts = D e , and co 0 = co, the harmonic frequency. 
Equation 4 can be regarded as a generalization 
of the Morse formula where ra is allowed to take 
values greater than 2. The formula also satisfies 



the required physical boundary conditions of a 
limiting harmonic frequency co 0 at E = 0 and 
co eff = 0 at E = E TS . 

The dependence of the ra parameter on po- 
tential shape is illustrated in Fig. 2 (see sup- 
plementary text for further details). The lower 
limit is ra = 2, the Morse oscillator, where the 
asymptote is approached infinitesimally with r. 
The other limit is a truncated harmonic oscil- 
lator where the potential abruptly becomes con- 
stant at E ts . In such a case, co eff falls instantly to 
zero and m = oo. 

Dissociation versus isomerization 

The physical arguments presented here regard- 
ing the behavior of co eff versus E (and therefore 
Eq. 4) pertain only up to E = E TS . Above that 
energy, co eff can either remain at zero for an un- 
bound system (as with the Morse oscillator) or 
rise again (in a bound system). The above-barrier 
behavior of co eff depends on the outer walls of the 
potential and is not described by Eq. 4. The rath 
root form of Eq. 4 suggests the presence of a 
branch point at ^TS> which separates the above- 
barrier and below-barrier eigenspectra into two 
distinct energy regions (19, 24). 

A semiclassical analysis of long-range inter- 
atomic potentials of the form D - (C/r n ) was per- 
formed by LeRoy and Bernstein (10) more than 
40 years ago. They derived an expression that 
relates the change in energy per quantum number 
(i.e., the effective frequency) near the dissociation 
limit to a quantity proportional to [1 - (E/D)f n+2)/2n . 
This expression is clearly similar to our effective 
frequency formula, but the two models treat 
dynamically and mathematically distinct re- 
gimes. For inverse power-law potentials where n = 
{1, 2, 3, 4, ...}, the corresponding ra values are 
{ 2 / 3 , 1, 6 / 5 , 4 / 3 , ...}. In the limit n — ► oo, the ef- 
fective ra value approaches 2 from below. In 
contrast, our model has a lower limit of ra = 2. In 
other words, these two similar effective frequency 
expressions treat essentially disjoint classes of 
potentials. The key difference is how the station- 
ary point (or dissociation limit) is approached. 
For long-range potentials with inverse power- 
law forms, the stationary point at r — ► oo is 
approached only polynomially. Our treatment 
considers potentials where stationary points 
are local maxima and are therefore approached 
over a finite domain. The common system, the 
Morse potential, has a stationary point at r — ► oo 
but approaches it exponentially (i.e., faster than 
any power law) and is in some sense simulta- 
neously long-range and local. Graphically, the 
dynamical distinction corresponds to positive 
curvature (LeRoy-Bemstein) versus negative cur- 
vature (our model) on a Birge-Sponer plot, with 
the linear plot of the Morse oscillator dividing the 
two regimes. 

Practical application 

What is the best way to extract the desired saddle 
point energy from spectroscopically measured 
quantities? From the frequency-domain spectrum 
we measure the energies of a series of quantized 
vibrational levels, and take the average E and 




G) eff (cm -1 ) 

Fig. 4. The (e- 0) effective frequency analysis for HCN and HNC. Shown are experimental 
data points (blue), Dunham polynomial expansion predictions using only experimental data (green), and 
the assigned ab initio data points (red) (26) (see supplementary text for details). The fitted E TS 
parameters using Eq. 4 (blue) are compared with the ab initio barrier heights (red). A one-dimensional 
cut through the potential energy surface is shown as a red dashed line. The unusual shapes of the HNC 
potential and co eff plot near 5000 cm -1 result from interaction with a low-lying excited diabatic electronic 
state (44). 
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difference co eff of adjacent level energies to obtain a 
set of (E, co eff ) data points (25). Equivalently, time 
domain spectroscopy may provide alternative 
or more direct ways to obtain vibrational periods 
or frequencies versus energy, especially for 
larger systems. A plot of these (E, co eff ) data 
reveals any dynamical trend in co eff , whether 
constant, linear, or nonlinear. If a stationary point 
is present within the data range of E, the plot will 
dip to a minimum as the energy of the stationary 
point is approached, although in a bound quantum 
system there will never be a point with co eff equal to 
zero. 

In principle, only this bare minimum of in- 
formation is necessary to apply Eq. 4. A value 
for E ts can then be obtained, which in favor- 
able cases should have an uncertainty of 5 to 
10% of the effective frequency, depending on 
the extent and quality of the input data. High 
resolution and detailed spectroscopic assign- 
ments are not a requirement, nor are ab initio 
calculations, although these can help to iden- 
tify the active vibrations and the nature of the 
transition state. Even in larger molecules, where 
full spectroscopic and computational analyses 
are impractical, the problem must simplify to a 
very small number of vibrational modes that 
form the reaction coordinate and lead to the 
transition state. These active vibrations reveal 
themselves by their isomerization dips. 

To demonstrate the capabilities of our method, 
we applied the model to two prototypical isom- 
erizing molecules, HCN and the S a state of C 2 H 2 . 
These systems have been spectroscopically char- 
acterized in great detail, such that we can apply the 
isomerization dip method and Eq. 4 to them with 
confidence. We emphasize that the levels of know- 
ledge and quality of data available for these systems 
are not necessary in general for the application of 
our method. Furthermore, despite the small sizes 
of HCN and C 2 H 2 , they exhibit many of the 
complications expected in larger molecules. 

The examples of S 0 HCN HNC 
and Si C 2 H 2 

The potential surface for the electronic ground 
state of the [H,C,N] system has two minima: the 



linear HCN and HNC isomers, separated by 
approximately 5200 cm -1 . The reaction coordi- 
nate of the bond-breaking HCN-HNC isomer- 
ization corresponds mainly to the v 2 bending 
vibration, and the barrier to isomerization is 
nearly 17,000 cm -1 above the HCN minimum. Ex- 
tensive experimental term values for both isomers 
are available up to 10,000 cm -1 above the HCN 
minimum (26). To continue the analysis up to and 
beyond the barrier energy, we used spectroscop- 
ically assigned ab initio eigenenergies (26-28). In 
(26), levels with high bending excitation were 
reported to deviate unexpectedly from effective 
Hamiltonian predictions, reflecting the presence 
of the double-well potential. 

The Si state of C 2 H 2 supports cis and trans 
conformers, with the cis conformer lying about 
2672 cm -1 above the trans. As illustrated in Fig. 3, 
the transition state is planar and nearly half-linear 
(29). The bare saddle point energy is calculated to 
be 4979 cm -1 above the trans minimum, but with 
an uncertainty of hundreds of cm -1 , even for the 
most accurate calculations to date (30). A torsional 
isomerization path might have been expected on 
the basis of cis-trans isomerizations in other mole- 
cules, but this is not found here. 

The height of the barrier relative to the fun- 
damental frequencies leads us to expect at least 
some normal vibrational structure, even in the 
shallower cis well. Thus far, several cis vibra- 
tional levels have been identified, in reasonable 
agreement with ab initio calculations (31-33). In 
the trans well, almost all of the vibrational levels 
below the barrier have been assigned (33). Of the 
six trans conformer vibrational modes, four are 
fairly well behaved: the Franck-Condon active 
vibrations v 2 (CC stretch) and v 3 (trans bend) 
(34), and the CH stretching modes v 1 and v 5 
(35, 36). On the other hand, a large portion of 
the trans vibrational manifold can only be un- 
derstood within the framework of bending 
polyads B n= ^ +V6 ^ (37), because of the Darling- 
Dennison and Coriolis interactions between the 
low-frequency ungerade bending modes, v 4 (tor- 
sion) and v 6 (cis bend). 

Despite the success of the polyad model in 
reproducing the level structures associated with 



the bending vibrations, there are disturbing ex- 
ceptions. As illustrated in figure 13 of (38), the 
series of 3 n B 2 polyads exhibits a surprising trend, 
with the energy of the lowest member of the 
polyad decreasing rapidly relative to the ener- 
gies of the other polyad members. Although in- 
explicable by conventional models, this occurrence 
turns out to be intimately related to the isom- 
erization dynamics discussed here. 

Determination of the barrier height 

We now apply the isomerization dip concept, 
and in particular Eq. 4, to the barrier proximal 
energy levels discussed above. Figure 4 shows the 
results of the pure bending (co^) effective fre- 
quency analysis for HCN-HNC. The barrier heights 
for both wells are found to be within 1% of the ab 
initio values. To compare ify s to calculated barrier 
heights, either the ab initio zero point energy must 
be subtracted from the calculated barrier height, or 
an effective zero point energy must be added to the 
fitted E ts value (see supplementary text). A con- 
sistency check of the HCN-HNC analysis stems 
from another dynamical parameter that affects 
cc> 2 ff and the effective barrier height: the vibra- 
tional angular momentum, i. The fitted ify s barrier 
heights are summarized in table S 5 and, as ex- 
pected, the barrier height increases approximately 
quadratically with 1. 

Figure 5A shows plots of cojf and oojf for the 
3 n 6 2 series of C 2 H 2 , where Eq. 4 can be seen to fit 
the observed data very well (see tables SI and S2 
for details of the fits). The 3 n 6 2 levels experience 
the effects of the barrier most strongly, whereas 
the 3 n levels are completely uninfluenced by it, 
because a combination of q 3 and q 6 is required 
to access the transition state geometry. Both co§ ff 
and oojf can be obtained as a function of v 6 as 
well, reading the array of term values in table 
S2 horizontally rather than vertically. The same 
{E, co eff } data are obtained, but in different sets. 

Reaction path analysis 

Several possibilities arise when the co eff analysis 
is extended to additional vibrational progressions. 
The first, shown in Fig. 5C, is that of differen- 
tiating between isomerization pathways. We 
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Fig. 5. The to eff analysis for Si C 2 H 2 .The dip is expected near the ab initio value of 4979 cm -1 . Data are from tables SI to S3. (A) Experimental co eff for the 3 n 6 2 levels, 
shown with fits to Eq. 4. co| ff is obtained directly from the progression of 3 n 6 2 levels, and co| ff is derived from the 3 n 6 2 and 3 n 6 : levels at a given n 3 . (B) Experimental co| ff 
for the 3 n 6 2 levels as compared to co§ ff for other progressions with varying quanta of v 6 , shown with fits to Eq. 4. The 3 n 6 2 series has the sharpest dip, in close analogy 
to the K = i - 0 series in the “Dixon dip” (12). (C) oaf plots for the 3 n , 3 n 4 2 , and 3 n 6 2 progressions. The 3 n 4 2 series follows the normal behavior of the 3 n levels, despite 
the isomerization dip observed in the 3 n 6 2 levels. This shows that the torsional mode v 4 is not involved in the isomerization process at these energies. 
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have noted already that certain progressions, 
such as the 3 n of trans-C^H^, show no sign of an 
isomerization dip. Here we see that surprising- 
ly, the 3 n 4? levels exhibit the same co§ ff as the 
3 n , from which we conclude that the torsional 
cis-trans isomerization pathway is closed at 
these energies. This observation is consistent 
with the harmonic behavior in v 4 noted in (37). 
Furthermore, the very strong interactions be- 
tween v 4 and v 6 , as well as the inevitable evolu- 
tion of the torsion as the molecule straightens, 
would have led us to predict some kind of non- 
linear behavior of the co§ ff from the 3™4 2 levels. 
The absence of any such effects implies that the 
torsion is a spectator mode independent of the 
isomerization occurring in v 3 and v 6 . Unlike in 
many other molecules, torsion does not play a 
role in this cis-trans isomerization. It appears 
that the residual n bond, despite its incomplete 
complement of substituents, leads to the pref- 
erence for in-plane isomerization (Fig. 3). 

The clear distinction between spectator modes 
and isomerizing modes suggests that other prop- 
erties of the saddle point maybe obtainable from 
co eff analysis. Consider a separable system that 
consists of an asymmetric double minimum in 
x and a harmonic oscillator in y: V(x, y) = V(x) + 
(ky 2 / 2). Because the Hamiltonian is separable, E = 
E x + E y , and therefore cof does not depend on y, in 
the same way that oijf does not depend on v 4 in 
trans- C 2 H 2 . This means that the co v of the tran- 
sition state is unchanged from that of the minimum. 
We can then imagine a case where [cfVix, yWdy 2 
varies with x. In such a case it is possible to ex- 
tract a value for co v of the transition state from 
the spacing of the off curves, or by plotting co® ff 
directly, including data from above the saddle 
point energy (39). To give a specific example from 
a rotational degree of freedom, a quadratic fit of 
the I -dependence of the 0v 2 0 HCN barrier height 
(table S 5) yields (A - B) TS = 11.1 (+0.9) cm -1 for 
the transition state, which agrees well with the 
ab initio value of 12.2 cm -1 . These hitherto un- 
measurable transition state rotational constants 
and frequencies, in addition to the saddle point 
energy, are critical inputs to the expression for 
the rate constant in transition state theory (2). 

The ideas about spectator modes articulated 
here are confirmed by analysis of the HCN-HNC 
stretching mode progressions; neither shows any 
dip up to 19,000 cm -1 above the HCN minimum. 
Furthermore, the shifts in the fitted barrier heights 
for v 2 progressions built upon excitation in Vi and 
v 3 match well with the one-dimensional pseudo- 
potentials U Vl! v 3 (0) from (40-42) and the stretch- 
ing frequencies (table S4). 

Implications and outlook 

The method described here provides qualitative 
and quantitative information about the isomeri- 
zation mechanism and the transition state solely 
on the basis of experimental data. The reaction 
coordinate can be identified from the isomeriza- 
tion dips shown by active vibrations, and quan- 
titative information about the energy, vibrational 
frequencies, and rotational constants of the tran- 
sition state becomes available. It is especially 



promising that this analysis stems entirely from 
a small subset of the vibrational levels; in other 
words, a full vibrational analysis is not necessary. 
Special states exist that encode chemically im- 
portant information, although it may not always 
be easy to recognize them. 

The most exciting outcome of the excellent fits 
using Eq. 4 is the determination of the transition 
state energies. The uncertainties in E TS are at 
least as good as what is currently available from 
theory. In many cases, experiments might only 
confirm theoretical predictions, but we expect 
them to sharpen our understanding, and the 
potential value of our method goes well beyond 
validating theory. For example, we envision that 
this approach is a step toward establishing ki- 
netics on the same firm experimental foundation 
as that already enjoyed by thermochemistry, 
where precise information is broadly available. 
This could aid in modeling complex reaction net- 
works as well as provide benchmarks for theo- 
retical calculations. 

The next stages in developing the concepts of 
isomerization dip and effective frequency will 
require more experimental data for model sys- 
tems as well as refinement of the basic ideas. For 
example, it is clear that co eff is a multidimensional 
quantity, which could be treated by suitable mul- 
tidimensional analysis. However, it is not yet clear 
how to define E in many dimensions. On the 
experimental side, the MgNC ++ MgCN isomer- 
ization (43) is similar to HCN ++ HNC, although 
with a much lower barrier; with higher resolution 
than has been used so far, it should be possible to 
get detailed information on the isomerizing levels 
all the way to the barrier and beyond. 

The most promising prospect is the application 
of this approach to larger molecular systems. For 
example, the principles presented in this work 
will provide a framework for experimental char- 
acterization of the transition state and detailed 
mechanism in other reactions, such as cis-trans 
isomerization, as epitomized by stilbene, and 
1,2 hydrogen shifts, which are ubiquitous in or- 
ganic chemistry. Remarkably little frequency- 
domain spectroscopy has been done at the high 
energies relevant to chemical kinetics. Partly this 
is because it was not clear previously what could 
be learned, and partly it is because the spectra 
rapidly become very complicated at such ener- 
gies, so it is difficult to recognize the important 
patterns. In time it is likely that new classes of 
experiments will sample the information about 
isomerization that is encoded in time- or frequency- 
domain spectra of larger molecules. Our hope is 
that the concepts and examples described here 
will be viewed as templates for the characterization 
of isomerizing systems, thereby challenging and 
guiding spectroscopists to attack similar problems 
of chemical interest. 
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SIGNAL PROCESSING 

Subnoise detection of a fast 
random event 

V. Ataie, D. Esman, B. P.-P. Kuo, N. Alic, S. Radic 

Observation of random, nonrepetitive phenomena is of critical importance in astronomy, 
spectroscopy, biology, and remote sensing. Heralded by weak signals, hidden in noise, they 
pose basic detection challenges. In contrast to repetitive waveforms, a single-instance signal 
cannot be separated from noise through averaging. Here, we show that a fast, randomly 
occurring event can be detected and extracted from a noisy background without conventional 
averaging. An isolated 80-picosecond pulse was received with confidence level exceeding 99%, 
even when accompanied by noise. Our detector relies on instantaneous spectral cloning 
and a single-step, coherent field processor. The ability to extract fast, subnoise events is expected 
to increase detection sensitivity in multiple disciplines. Additionally, the new spectral-cloning 
receiver can potentially intercept communication signals that are presently considered secure. 



T he spontaneous decay of a molecule (1), a 
fast radio-astronomy burst {2), or the arrival 
of a secure communication packet (3) are 
examples of nonrepetitive events that pose 
similar challenges for observers. In addition 
to requiring a fast and sensitive detector, a 
randomly occurring signal must be extracted 
from the background noise in either the optical 
or electrical domain. If the event is repetitive, 
this noise can be substantially eliminated by 
classical averaging (4). When noise is temporally 
uncorrelated, the summation of a sufficiently 
large number of signal instances can lead to 
an arbitrary signal-to-noise ratio (SNR) improve- 
ment (4). Simple to implement, averaging plays 
a critical role in the capture of fast, weak, or 
noisy signals. A powerful generalization of the 
averaging concept was developed for cyclosta- 
tionary signals accompanied by spectrally uncor- 
related noise (5). In contrast to temporal averaging, 
cyclostationary and cumulant analysis recognizes 
that noise spectral components can be rejected if 
the signal is cyclically modulated. Intuitively, a 
similar treatment could be extended to a single- 
instance signal if it can be temporally replicated. 
In earlier work (6), a recirculating loop was used 
to create multiple, serialized signal outputs that 
can be independently detected and subsequently 
averaged. Such replication requires an amplifier 
to overcome coupling losses, thus injecting excess 
(amplification) noise. Unfortunately, all known 
temporal replication mechanisms (6, 7) are inher- 
ently lossy and are subject to this basic limitation. 

However, a single event can be replicated to 
frequency nondegenerate copies (spectral clones) 
in a nearly noiseless manner (8). We show that 
an isolated pulse can be extracted from noise by 
single-step processing of its spectral clones. To 
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demonstrate single-event detection, an 80-ps-long 
pulse was cloned and processed in order to in- 
crease the received SNR by 14.1 dB. To separate 
signal from noise, the experiment varied the 
number of spectral copies from 6 to 36, proving 
that background rejection can be progressively 
increased even when the random event is immersed 
in a high level of background noise. 

When a repetitive signal s{t) is accompanied by 
noise n{t), its estimate can be reached by coherent 

summation (4): ttr/l [s{t) + n(t)\dt, 

where t k and AT indicate the signal instance and 
observation (detection) interval, respectively. For 
a sufficiently large occurrence count (N » 1), 
when noise is a zero-mean uncorrelated process 
(4), this estimate can be arbitrarily accurate 

because - 0. In the 

case of a single event (N = 1), noise cannot be 
repeatedly measured but can still be discriminated 
from the signal in the spectral domain. Indeed, 
whereas two distinct signal spectral components 
have a deterministic relation, the same is not 
true for noise components. This important dif- 
ference was recognized (5) and used to discriminate 
S-correlated noise from the signal. In the simplest 
cyclostationary receiver implementation, the prod- 
uct of two spectral components is integrated over 
many signal cycles in order to acquire the cor- 
relation between any pair of spectral components. 
The correlation collapses when only noise is re- 
ceived, thus allowing for signal feature extraction (5). 

Cyclostationary detection fails to reject the 
noise accompanying an isolated (single-cycle) 
event because no correlation can be drawn at 
any time instance beyond the event duration. 
However, by replicating such a signal in a sub- 
stantially noiseless manner, one can still exploit 
the fact that the signal possesses spectral cor- 
relation, whereas the received noise does not 
(Fig. 1). To describe noise discrimination in 



this case, let us assume that a set of lossless 
narrow filters with bandwidth 8/ can be con- 
structed over the full span of a A/-wide signal 
(Fig. 1). The spectral decomposition ( 9 ) of the 
received field r{t) = s(t ) + n{t) can be described 
by the short-time Fourier transform (STFT) 

R{t, khf) = w{t-x)r(x)e f c4c, where 

w{t) = sin(7rS/f)/;rlis the 8/- wide band-pass window 
function. STFT is a slowly varying complex 
function that, when sampled within the detection 
interval [t ± 1/(28/)] (9), represents the received 
spectral component centered at f k as the phasor 
sum R k = S k + rifr In the absence of noise, its 
magnitude is a measure of the signal spectral 
density S{f k ); in the absence of the signal, the 
magnitude and phase of this phasor is defined by 
the statistics of the interfering noise field. 

The STFT cannot be realized by mere filtering 
centered at f k because it will result in the carrier- 
specific phase rotation 2nf k t. To address this, the 
differential phasor rotation must be stopped by 
frequency shifting each output to the baseband 
(9). Consequently, phasor summation across the 
entire bandpass set leads to a noise-sensitive 
outcome. For a noiseless, transform-limited signal 
pulse, this summation results in collinear vector 
addition. In contrast, when only noise is present 
this summation resembles a random walk in 
the complex plane, (Fig. 1, inset X). Consequently, 
the effective SNR increase provided by a spec- 
trally cloning detector should scale with the 
replica count N as ~ (N/Vn) 2 {10). 

Although the single-event detector is concep- 
tually simple, its realization faces a set of basic 
challenges. In the first of these challenges, the 
spectral decomposition process must be distor- 
tionless and lossless. This requirement is easily 
satisfied for slow, microsecond-scale events that 
can be quantized with high precision (11, 12). 
In this case, the digitized field can be used to 
calculate the sampled STFT and emulate the 
coherent subband summation (Fig. 1). However, 
when the event is fast (subnanosecond), signal 
quantization imposes a fundamental resolution 
limit {11, 12), eliminating such a computational 
approach. Its alternative, physical channeliza- 
tion {13), is neither distortionless nor lossless. 

Recognizing these limits, we mapped the re- 
ceived signal onto widely separated frequency 
carriers (A F) and performed spectral decom- 
position by means of a strictly periodic bandpass 
process (Fig. 1). In the optical domain, the received 
spectrum can be parametrically cloned (8) and 
subsequently decomposed by a single physical 
filter (14). This strategy cannot be applied in 
spectral ranges where efficient, low-noise para- 
metric mixers (15, 16) do not exist. Among those 
ranges, the microwave range (0.3 to 100 GHz) is 
arguably the most important because it hosts 
commercial (17), scientific (17), and defense (17, 18) 
signals. To construct a single-event detector 
operating in this band, we mapped the received 
field onto a highly coherent frequency comb 
(Fig. 2). 

In the first experiment, a single-instance signal 
was represented by a 12-GHz-wide pulse and was 
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Fig. 1. Single-event noise discrimination. A set of lossless bandpass filters (5 f) spectrally decomposes the signal and noise fields into a set of S k and n k 
phasors. In the reference frame of the signal carrier, these phasors are not synchronous and rotate at different rates co = 2nf k (k = 1, /V); Spectral cloning (SC) 
maps the received field onto a widely spaced grid (AT) and enables lossless spectral decomposition by means of a single periodic process (filter). To coherently 
sum signal (noise) phasors, differential phasor rotation must be arrested by means of a frequency-invariant baseband process. 



cloned to six spectral replicas. The pulse was cen- 
tered at 6.5 GHz and combined with broadband 
noise generated by a photodiode that receives 
amplified spontaneous emission from an erbium- 
doped fiber amplifier. The signal and noise were 
combined and used to modulate a tunable optical 
frequency comb (TOCi), replicating the noise- 
loaded signal onto its frequency modes. Each 
mode had an optical SNR in excess of 40 dB, mea- 
sured within a 12.5-GHz bandwidth and at 1550 nm. 
TOCi had a continuously variable frequency pitch 
(19) that was set to A F = 50 GHz. The second 
(Vernier) optical comb (TOC 2 ) was used as a local- 
oscillator (LO) array, with frequency pitch differ- 
ing by 5/ = 2 GHz from the signal comb. Both 
optical combs were seeded by a single master os- 
cillator with a spectral linewidth of 3 kHz, guar- 
anteeing a high degree of mutual coherence (20) 
during the longest signal capture time (~2 56 ps). 
The ability to continuously tune the frequency 
pitch of both optical combs is critical because it 
defines the offset between a specific spectral 
replica and the distinct LO mode. When combined 
in a detector with bandwidth 8/b, the beating be- 
tween the replica and the LO mode selects a 
S/d- wide spectral segment that is centered f k = 
k x 5/ away from the replica carrier. If the de- 
tector bandwidth and the comb frequency offset 
are matched (5/) = 5/, the comb-assisted cloning 
becomes equivalent to the spectral decomposition 
(Fig. 1). Two wavelength-demultiplexing elements 
(WDMs) were used to route the spectrally over- 
lapping copy and LO mode to a coherent detector 
(D). A matched detector bandwidth, defined by 
the ratio of the pulse bandwidth (A f = 12 GHz) 
and the spectral replica count N = 6, defines the 
frequency offset between the signal (TOCO and 
Vernier combs (TOC 2 ), 5 >/= A f/N = 2 GHz. Last, 
the output of each detector D is sampled and used 




Fig. 2. Spectral cloning receiver. Two frequency-tunable optical combs (TOCi and T0C 2 ) are 
derived from a single master laser (ML). A single pulse is combined with noise and used to modulate 
TOCi with frequency pitch A F. The Vernier comb T0C 2 , with frequency pitch A F + 8f, serves as local 
oscillator array and is combined with the received signal at the subrate detector array (D). 



to perform coherent summation in order to discrim- 
inate the pulse from the noise, as detailed in (21). 

The first measurement generated six spectral 
copies of an 80-ps pulse accompanied by various 



levels of interfering noise (Fig. 3A). The input 
SNR (SNR in ) was defined as the ratio of the 
signal and noise powers, measured within the 
observation interval iV/A /= 500 ps. We performed 



1344 11 DECEMBER 2015 • VOL 350 ISSUE 6266 



sciencemag.org SCIENCE 










RESEARCH \ REPORTS 



20 



GO 

■D 

T 15 

Z) 

O 

K 

z 

CO 10 




10 15 

SNR| N (dB) 



20 




Fig. 3. Single pulse detection. (A) An 80-ps pulse with varied noise level was 
detected by generating and processing six spectral replicas (red curve). The inset 
illustrates pulse presence (5) and absence (0). The blue curve indicates the 
performance of the ordinary receiver. (B) Sensitivity scaling. Spectral replica count 
was varied to perform 6-, 12-, 24-, and 36-copy detection. Curve family E cor- 
responds to coherently summed replicas; S is conventional detection; the cor- 
responding receiver’s bandwidth (BW) is shown for each spectral copy count. 
(C) Time reconstructed and scatterplot of the detected random pulse by (1) con- 
ventional and (2) 36 coherently summed replicas. 
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4720 detections to quantify the ability to distin- 
guish signal from noise. Measurements (Fig. 3A) 
are shown with each point in the scatter plot 
corresponding to a single detection outcome. 
Intuitively, the detection of a low-noise pulse 
corresponds to highly distinguishable ensemble 
scatters. As an example, the measurement ensem- 
ble 5 (Fig. 3A) corresponds to the detection of 
a signal with SNR in = 22.3 dB, whereas in 0, 
the signal was absent. The less overlap between 
these scatters reflects a higher confidence level 
(22) with which one can differentiate between 
pulse presence and absence, as detailed in (21). To 
quantify the spectral-cloning receiver performance, 
we define the output SNR (SNR OUT ) as the ratio 
between the square of the scatter mean separa- 
tion and its variance. As an illustration, a pulse 
with SNR in = 7.9 dB is detected with SNRqut = 
11.9 dB, indicating a 4-dB increase in the 
level of detection confidence. To compare the 
performance of the spectral cloning and con- 
ventional detector, we did not perform coherent 
decomposition and summation in a subsequent 
set of measurements. Instead, the output of each 
detector was recorded and its average was plotted 
(Fig. 3A). The increase in SNR OUT between the two 
cases varies from 7.1 dB (SNR^ = 22.3 dB) to 7.5 dB 
(SNR^ = 18.3 dB), which is in agreement with the 



value predicted previously for six (N = 6) spectral 
replicas ~ (N/VN) 2 ~ 7.8 dB. 

Last, to test the main hypothesis, which predicts 
that the sensitivity increases with spectral clone 
count, we constructed receivers with 12, 24, and 
36 replicas. To circumvent the physical scaling 
of the subrate detector array, interfering noise 
was synthesized by using a 64-GS/s digital-to- 
analog-converter (DAC) and combined with a 
12-GHz-wide pulse, as described in (21). Clone- 
scaling measurements (Fig. 3B) indicate that an 
average sensitivity gain of 10.4, 13.1, and 14.1 dB 
was achieved when 12, 24, and 36 copies were 
generated, respectively. These measurements agree 
remarkably well with the predicted sensitivity gains 
(N/VN) 2 ~ 10.8, 13.8, and 15.6 dB, when N = 12, 
24, and 36, respectively. 

These results prove the importance of low-noise, 
low-distortion spectral replication for the detec- 
tion of a random, nonrepetitive signal. Although 
we describe the detection of a single pulse, it is 
not difficult to recognize that spectral cloning 
can be used for considerably more complex an- 
alyses of random events, in both the microwave 
and optical domains. Besides discriminating 
noise, the ability to accurately replicate a fast, 
random event onto an arbitrary frequency 
map paves the way for low-latency, computation- 



free Fourier processing beyond simple coherent 
summation. 
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SURFACE SCIENCE 

Electron-hole pair excitation 
determines the mechanism 
of hydrogen atom adsorption 

Oliver Biinermaim, 1 ’ 2 ’ 3 * Hongyan Jiang , 1 Yvonne Dorenkamp , 1 

Alexander Kandratsenka , 1 ’ 2 Svenja M. Janke , 1 ’ 2 Daniel J. Auerbach , 1 ’ 2 Alec M. Wodtke 1 ’ 2 ’ 3 

How much translational energy atoms and molecules lose in collisions at surfaces 
determines whether they adsorb or scatter. The fact that hydrogen (H) atoms stick to 
metal surfaces poses a basic question. Momentum and energy conservation demands that 
the light H atom cannot efficiently transfer its energy to the heavier atoms of the solid 
in a binary collision. How then do H atoms efficiently stick to metal surfaces? We 
show through experiments that H-atom collisions at an insulating surface (an adsorbed 
xenon layer on a gold single-crystal surface) are indeed nearly elastic, following the 
predictions of energy and momentum conservation. In contrast, H-atom collisions with 
the bare gold surface exhibit a large loss of translational energy that can be reproduced 
by an atomic-level simulation describing electron-hole pair excitation. 



A dsorption of atomic hydrogen (H) is the 
simplest reaction in surface chemistry. 
Langmuir’s study of this reaction ushered 
in the era of modern surface science (1). 
Hydrogen adsorption is important for many 
fields, ranging from heterogeneous catalysis (2) 
to interstellar molecular hydrogen production 
(3). Adsorbed H atoms can stabilize surfaces of 
intrinsically reactive solids, healing dangling bonds 
and making them suitable for industrial process- 
ing (4). Adsorption is also central to hydrogen 
storage technologies (5), and it is the basis for a 
chemical means of manipulating the band gap 
in graphene (6). 

Despite more than a century of study, we still 
do not have a fimdamental understanding of how 
H-atom adsorption takes place. Adsorption in- 
volves the H atom coming to rest at the surface, 
losing its initial translational energy, and dissi- 
pating the energy of the chemical bond formed 
with the solid (Fig. 1A). Because of its light mass, 
energy and momentum conservation requires 
that the transfer of H-atom translational energy 
to heavy surface atoms is inefficient; for exam- 
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pie, an H atom colliding with a gold atom at a 
Au(lll) surface is expected to transfer only 2% 
of its translational energy per collision (Fig. 
IB). How then can the H atom lose sufficient 
translational energy to adsorb? As early as 1979, 
speculations were made, supported by theoret- 
ical analysis, that the mechanism of H-atom ad- 
sorption at metals could involve the conversion 
of H-atom translational energy to electronic excita- 
tion of the solid (7). This requires a failure of the 




distance of H-atom from surface 



Bom-Oppenheimer approximation (BOA), which 
assumes that electronic motions are much faster 
than nuclear motions and can be treated sep- 
arately ( 8 ). Although failure of the BOA is not 
without precedence— for example, infrared line- 
widths of chemisorbed H atoms on metals are 
believed to be broadened by electronic inter- 
actions (9), and “chemicurrents” have been de- 
tected at Schottky diode junctions (10-12 )— there 
are no experimental measurements of the trans- 
lational inelasticity of H atoms with any solid. 
Moreover, translational excitation of electron- 
hole pairs occurring because of collisions of 
atoms or molecules with surfaces has never been 
observed in the absence of efficient phonon 
excitation (13). 

Previous experiments on BOA failure showed 
that highly vibrationally excited molecules ex- 
hibit efficient vibrational relaxation when they 
collide with a clean single-crystal metal surface, 
whereas little relaxation is seen with insulators 
(14, IS). This comparison showed the importance 
of electronic excitation by molecular vibration, a 
phenomenon that could also be investigated with 
first-principles theory (16, 17). Although vibra- 
tional relaxation studies tell us nothing about 
adsorption, they suggest an approach to the 
problem. If BOA failure were important in 
H-atom adsorption, we would expect inelastic 
H-atom scattering from metals and insulators 
to exhibit dramatic differences in their trans- 
lational energy loss; furthermore, we could only 
describe the inelasticity with modem theoretical 
methods that account for electronic excitation 
(18-20). 

B 

Efin = E in (rri! - + m 2 ) 2 




Fig. 1. Adsorption of H atom requires loss of translational energy. (A) The incident H atom must 
lose its initial translational energy, E in , and dissipate the chemical potential energy, E 0 , that it discovers in 
binding to the surface. (B) Conserving linear momentum and translational energy in a simple collinear 
binary collision model leads to a simple relation between E in and the final kinetic energy of the H atom, 
E fin , that depends only on the masses of the atoms. For the example of H (m : = 1) colliding with Au (m 2 = 
198), the H atom retains 98% of its initial energy. 
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Experiments probing inelastic H-atom scat- 
tering from surfaces are extremely challenging. 
Previous studies on H-atom scattering from solids 
used discharge-based H-atom sources and, in some 
cases, electromagnetic velocity filters (21, 22). These 
approaches yield relatively broad H-atom veloc- 
ity distributions that peak at low translational 
energies. Detecting H atoms is also challenging: 
Bolometers (22), photographic plates (23), and 
ZnO conductivity detectors (24) were sensitive 
enough to observe surface scattering, but their 
slow temporal response precludes the study of 
inelastic scattering. These experimental limita- 
tions help explain why, since the first successful 
observations of H atom scattering from sur- 
faces (23), additional studies have measured spa- 
tially resolved diffraction rather than inelastic 
scattering. 

Here, we show that the translational energy 
loss of H atoms colliding at a metal surface pre- 
dominantly results from electronic excitation of 
the solid. We produced nearly monoenergetic 
incident beams of H atoms by laser photolysis, 
the energy of which can be varied (25, 26), and 
obtained scattering-angle resolved, translational 
energy loss spectra by the Rydberg-atom neu- 
tral time-of-flight (TOF) method (27). Our mea- 



surements show that collisions of H atoms at 
metal surfaces are strongly inelastic. In contrast, 
H-atom collisions at an insulator are nearly elastic. 
For the insulator, the small inelasticity can be 
understood as a simple binary collision between 
a light and heavy atom where linear momentum 
is conserved. For H-atom energy loss at a metal, 
we used a recently developed full-dimensional 
molecular dynamics (MD) method (20) capable 
of describing both excitation of the solid lattice 
and electron-hole pairs. This model gives good 
agreement with experimental results. Switching 
off electron-hole pair excitation in the simulations 
resulted in energy loss far less than observed. 

A schematic diagram of our apparatus (Fig. 2) 
shows the pulsed molecular beam expansion that 
efficiently cooled HI to its ground state, where 
ultraviolet (UV) laser photolysis produced nearly 
mono-energetic H atoms. A small fraction of these 
atoms passed through two differential pump- 
ing chambers (not shown), entered an ultrahigh 
vacuum (UHV) chamber, and collided with a gold 
(Au) single crystal. The incidence angles, and 
(p z -, were varied by tilting the Au crystal, which 
was held in a six-axis UHV manipulator. Recoil- 
ing H atoms were subjected to Rydberg tagging 
(27)', that is, they were excited by two laser pulses 



to the long-lived n = 34 Rydberg state, which 
lies just ~10 meV below the ionization level. 
These neutral atoms passed a detector aperture 
and traveled 25 cm in a field-free region, and 
then through a grounded wire mesh, to encounter 
a weak (~7 kV/cm) ionizing field just in front of 
an ion counting detector. H-atom TOFs were 
recorded by a multichannel scalar. The detector 
could be rotated so that TOF data could be ob- 
tained at many scattering angles, d s . 

The Au surface was cleaned by cycles of Ar- 
ion sputtering and annealing at 1000 K. Auger 
electron spectroscopy (AES) and low-energy elec- 
tron diffraction (LEED) were used to determine 
the cleanliness and orientation of the Au(lll) sur- 
face. The Au sample could also be cooled to 45 K 
with cold gaseous He, allowing Xe condensation. 
We used a 300-Langmuir exposure (1CT 6 mbar 
Xe gas for 5 min) to produce a thick Xe layer (an 
insulating surface) whose structure was not in- 
fluenced by the underlying Au crystal. Warming 
easily removed the Xe layer, allowing H-atom 
scattering measurements from metal and insu- 
lator to be made within minutes of one another. 

Figure 3A shows representative TOF data for 
H-atom scattering from Au (open squares) and 
solid Xe (filled squares). The scattering conditions 
were = 2.7 6 eV, fr* = 45°, d s = 45°, and <p z - = 0° 
with respect to the [101] direction. Figure 3B 
shows the translational energy loss distributions 
derived from the TOF data using the appropriate 
Jacobian. The inset shows the measured transla- 
tional energy distribution of the incident H atoms. 

There is a stunning difference in the observed 
H-atom inelasticity for scattering from metallic 
Au and an insulating Xe layer. The most prob- 
able energy loss for H-atom scattering from solid 
Xe was 46 meV, somewhat lower than that ex- 
pected for a collinear binary elastic collision be- 
tween a H and a single Xe atom (83 meV, shown 
as a vertical arrow in Fig. 3). For H-atom scatter- 
ing from gold, the average energy loss was 20 
times as high (910 meV). This energy loss is far 
too large to be compatible with the expectation 
for a H/Au binary collision model (56 meV), yet 
it is still far too small to be the result of H-atom 
trapping followed by thermal desorption. Fur- 
thermore, in contrast to the H/Xe scattering, which 
shows a very specific energy loss, the energy loss 
distribution for H scattering from Au was re- 
markably broad, extending out to at least 2.0 eV, 
suggesting that a broad continuum of acceptor 
states in the solid contributes to the translational 
inelasticity. These remarkable observations are 
compelling evidence that H-atom translational 
energy is efficiently converted to electronic ex- 
citation in collisions with solid gold. 

Although a binary electronically adiabatic col- 
lision model is sufficient to understand the es- 
sence of the H-atom scattering from solid Xe, 
more involved theory is needed to treat H-atom 
scattering from a metal (18, 19). Accurately de- 
scribing metal atom motion and electron-hole 
pair excitation are the two key challenges. Re- 
cently, we have developed an approach to MD 
simulations that self-consistently treats me- 
chanical energy transfer to Au lattice motion 
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Fig. 2. Schematic of the experimental apparatus. A molecular beam of rotationally cold HI is formed 
in a pulsed molecular beam expansion. After skimmer 1, the HI beam is crossed by the dissociation laser 
beam. A small fraction of the H-atom photoproducts pass skimmer 2, pass through two differential 
pumping stages (not shown), and enter a UHV chamber. Here, they hit the surface of a Au single crystal 
held on a six-axis manipulator that allows the variation of the polar (-£>,) and azimuthal (cp,) incidence 
angles. Scattered H atoms are tagged in a two-step process: first, a 121.57-nm photon brings the H 
atoms into the 2p state. Second, a 365.90-nm photon transfers the atoms into the n - 34 Rydberg state. 
A fraction of the tagged H atoms pass the detector aperture and travel 25 cm before they reach the 
detector, where their time of arrival is recorded. The detector is rotatable, allowing the variation of the 
scattering angle # s . 
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and electronic excitation (20). The MD is car- 
ried out on a global full-dimensional potential 
energy surface (PES) based on effective medi- 
um theory (EMT) fitted to ab initio electronic 
energies. Because the EMT intrinsically con- 
tains the embedded electron densities, we can 
self-consistently describe electronically non- 



adiabatic behavior on the level of the local- 
density electronic friction approximation (LDFA) 
(28), with no adjustable parameters. We per- 
formed MD calculations for several million 
trajectories, enough to make comparisons with 
the measurements of angle-resolved inelastic 
scattering. 



Fig. 3. Translational 
inelasticity for H-atom 
collisions with an 
insulator and a metal. 

(Top) Measured TOF 
spectra for H atoms 
scattered from Au(lll) 
(open squares) and 
solid Xe (filled squares). 
The channel width is 
8 ns for Au and 4 ns for 
Xe. (Bottom) 
Corresponding kinetic 
energy loss spectra 
obtained by Jacobian 
transformation of the 
TOF data. The inset 
shows the kinetic energy 
distribution of the inci- 
dent H-atom beam. The 
vertical arrow marks the 
expected energy loss for 
a binary collision 
between an H and a Xe 
atom. The experimental 
conditions are E in = 

2.76 eV, = 45°, = 45° 

and 4>, = 0°, with respect 
to the [10 1] direction. 



10.0 



12.0 



ToF / fts 

14.0 16.0 



18.0 



20.0 




Fig. 4. Comparison of 
the experimentally 
obtained kinetic 
energy loss spectrum 
to theoretical simula- 
tions. Theoretical 
energy loss found 
when neglecting (solid 
black line) and includ- 
ing (solid gray line) 
electronic excitation. 

Experimental energy 
loss for E in = 2.76 eV 
are shown as open 
squares. The vertical 
arrow marks the 
expected energy loss 
for a binary collision 
between an H and an 
Au atom. The inset 
shows the incidence 

energy dependence, E in , of the experimentally derived 
comparison to theory (solid lines): E in = 3.33 eV (blue), 
arrows mark the three incidence energies. Also shown are 




E -E f /eV 

in fin 



translational inelasticity (open squares) and 
1.92 eV (red), and 0.99 eV (black). Colored 
the average final translational energies, <E fin >. 



The scattering angles are -&j = 45°, d s = 45°, and cp, = 0° with respect to the [10 1] direction. In all cases, 
the scattered H atoms remain unthermalized with the solid, emerging with a substantial fraction of their 
incidence translational energy. 



Figure 4 shows some of these comparisons for 
H-atom scattering from Au(lll). The solid black 
line shows the theoretical prediction, neglecting 
electronic excitation. The narrow energy loss dis- 
tribution, peaking near the expected value for a 
binary collision of H with Au (56 meV, shown as 
a vertical arrow), clearly fails to capture the ob- 
served magnitude of the H-atom translational 
energy loss. The gray solid line shows the sim- 
ulated energy loss distribution when electronic 
excitation is included in the MD simulations at 
the level of the LDFA. Here, the theoretical en- 
ergy loss distribution captures the experimental 
result remarkably well. We have made extensive 
comparisons between experiment and theory, 
like those shown in Fig. 4 for a range of scat- 
tering angles, fy, c p i3 and the agreement is 
uniformly good. 

The inset to Fig. 4 shows how the translational 
inelasticity depends on the incidence energy 
and compares to electronically nonadiabatic MD 
simulations. At all incidence energies, agree- 
ment between experiment and theory is good 
and the energy loss is dominated by electronic 
excitation. We note that the fractional energy 
loss, (E in - <Efi n >)/E in = 0.33 ± 0.01, is nearly 
independent of E in , meaning that electron-hole 
pair excitation remains important even at re- 
duced incidence energies. This theoretical model- 
ing confirms the qualitative statement made 
above: H-atom translational energy is efficiently 
converted to electronic excitation in collisions 
with solid gold. 

The good agreement between experiment and 
theory is evidence for the validity of the approxima- 
tions made in the MD simulations. Furthermore, 
the ability of the simulations to reproduce these 
experiments lends weight to the predictions made 
in (20). Most interesting among these are the 
predictions that electron-hole pair excitation 
increases the sticking probability and deter- 
mines the adsorption mechanism, which occurs 
by penetration resurfacing. Here, H-atom ad- 
sorption occurs by initial population of sub- 
surface binding sites (where electronic excitation 
is most efficient) followed by migration to the 
strongest binding sites, which are at the surface. 
This work also invalidates a previous alterna- 
tive hypothesis, one where multiple electronical- 
ly adiabatic collisions resulting from a conversion 
of normal to parallel H-atom momentum lead to 
sticking (29). Inspection of individual trajecto- 
ries shows that such adsorption behavior occurs 
only when electronic excitation is included in the 
simulations (20). 

This study demonstrates the importance of 
electronic excitation in atomic scattering at metal 
surfaces and provides a valuable benchmark for 
first-principles theories of energy transfer and 
adsorption. The prospect of using an experimen- 
tally validated electronically nonadiabatic theory 
of H interactions at a solid metal is exciting and 
could lead to progress on important problems, 
including H-atom diffusion in bulk metals and 
on metal surfaces, adsorbate influences on sur- 
face reconstruction, quantum dynamics of adsorp- 
tion, and energetic atom diffusion and surface 
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penetration. More generally, chemical reactions 
at a metal surface are nearly always modeled 
within the adiabatic Born-Oppenheimer approx- 
imation; see, for example, (30). Our work sug- 
gests that theories of surface chemistry capable 
of describing electron excitation may be crucial 
to understanding atomic-scale motion occurring 
in surface reactions, especially if H-atom transla- 
tion is involved. 
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GEOPHYSICS 

Viscosity jump in Earth’s mid-mantle 

Maxwell L. Rudolph, 1 * Vedran Lekic , 2 Carolina Lithgow-Bertelloni 3 

The viscosity structure of Earth’s deep mantle affects the thermal evolution of Earth, the 
ascent of mantle plumes, settling of subducted oceanic lithosphere, and the mixing of 
compositional heterogeneities in the mantle. Based on a reanalysis of the long-wavelength 
nonhydrostatic geoid, we infer viscous layering of the mantle using a method that allows us 
to avoid a priori assumptions about its variation with depth. We detect an increase in 
viscosity at 800- to 1200-kilometers depth, far greater than the depth of the mineral phase 
transformations that define the mantle transition zone. The viscosity increase is coincident 
in depth with regions where seismic tomography has imaged slab stagnation, plume 
deflection, and changes in large-scale structure and offers a simple explanation of 
these phenomena. 



T he viscosity of Earth’s mantle controls the 
rate and pattern of mantle convection and, 
through it, the dynamics of our planet’s 
deep interior, including degassing of and 
heat transport from the interior, mixing 
of compositional heterogeneity, plume ascent and 
passive upwelling, and slab descent. The long- 
wavelength nonhydrostatic geoid is a key geo- 
physical constraint on Earth’s internal viscosity 
structure. At the largest spatial scales (spherical 
harmonic degrees 2 to 7), the geoid is most sen- 
sitive to density structure and viscosity contrasts 
in the lower mantle. At smaller scales, the geoid 
becomes increasingly sensitive to upper mantle 
structure, which is primarily associated with sub- 
ducting slabs. Because lateral viscosity variations 
have minor effects on the geoid at large spatial 
scales (1, 2 )— though they may become more im- 
portant at shorter length scales (3 )— it is pos- 
sible to infer deep mantle viscous layering from 
geoid observations. However, most studies of 
Earth’s mantle viscosity structure impose layer 
interfaces to be coincident with seismic velocity 
discontinuities. Thus, these studies may not re- 
solve viscous layering whose origin is distinct 
from that of pressure-induced phase changes 
(e.g., at 410- and 660-km depth), or may miss 
phase transitions not clearly associated with seis- 
mic discontinuities. 

We use the long-wavelength nonhydrostatic 
geoid to infer the mantle radial viscosity struc- 
ture in a manner distinct from that of previous 
attempts in three key ways. First, we employ a 
transdimensional, hierarchical, Bayesian inver- 
sion procedure (4) that does not specify at the 
outset the number or location of interfaces in our 
layered viscosity structure. The Bayesian approach 
is very attractive for this inverse problem because 
it yields a posterior probability distribution that 
can be analyzed to quantify uncertainties of and 
trade-offs between model parameters (e.g., layer 
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depth and viscosity contrast). Second, we explore 
various choices for the conversion between seis- 
mic velocity anomalies and density anomalies, 
including depth-dependent conversion factors 
based on thermodynamic principles, calculated 
using HeFESTo (5). Finally, we use a recent whole- 
mantle tomographic model, SEMUCB-WM1 (6), 
developed with waveform tomography using high- 
ly accurate wave propagation computations, to 
infer mantle density structure and a modem geoid 
model based on 10 years of GRACE satellite ob- 
servations, combined with revised estimates of 
the hydrostatic flattening of Earth (7, 8). 

A posterior probability density function for the 
radial profile of viscosity is shown in Fig. 1, where 
the mean (taken in log-space) viscosity at each 
depth is shown as a purple curve. In this par- 
ticular inversion, we find evidence for relatively 
uniform viscosity throughout the upper mantle 
and transition zone. Below the mantle transition 
zone, there is a region of lower viscosity and an 
increase in viscosity between 670- and 1000-km 
depth. The preferred depth of this viscosity in- 
crease can be inferred from Fig. IB and is cen- 
tered about 1000 km. 

We carried out multiple inversions to explore 
the effects of (i) our treatment of data and model 
uncertainty, (ii) the degree of tmncation of the 
spherical harmonic expansion of the geoid used 
to constrain our models, and (iii) the density scaling 
R p s = d In p/cZ In Vs (Fig. 1). We consider features 
of the viscosity profiles to be robust if they are 
common among the separate inversions. We find 
that all solutions place the depth of viscosity in- 
crease considerably below 670-km depth, most 
often near 1000-km depth. This result appears to 
be independent of assumptions made, including 
maximum spherical harmonic degree l max , choice 
of depth-dependent or constant R p S , or treat- 
ment of data and model covariance (7). Other 
features of the solutions are sensitive to these 
choices and, therefore, their robustness is pro- 
portional to the likelihood of the assumptions 
from which they result. Inversions with = 7 
(dashed curves in Fig. 2) generally have a more 
pronounced peak in viscosity in the mid-mantle, 
underlain by a weaker region between 1500- and 
2500-km depth and an increase in viscosity in 
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the lowermost mantle. Several solutions, using 
depth-dependent or R pS = 0.4, feature a 
lower-viscosity layer between 670- and 1000-km 
depth. Some solutions include a high-viscosity 
‘hill” in the mid-mantle between 1000- and 1500-km 
depth, separating upper and lower mantles of 
lower viscosity. 

Many early studies advocated for layered man- 
tle convection with an interface at or somewhat 
below 670-km depth, and in particular Wen and 
Anderson (9) noted that the amplitude and pat- 
tern of the long-wavelength geoid and surface 
topography could be well reproduced using man- 
tle flow models with an imposed barrier to flow 
about 250 km deeper than the 670-km seismic 
discontinuity. However, tomographic images of 
relict Farallon and Tethys slabs in the lower man- 
tle suggest that the concept of layered mantle 
convection is at best incomplete, and we empha- 
size that our mantle flow calculations do not im- 
pose layered convection. 

Our results favor viscosity structures in which 
the overall increase in viscosity is a factor of 10 to 
150, in agreement with previous studies. All of our 
results favor the location (interface depth) of this 
viscosity increase lying below 670-km depth, and 
most models place this viscosity increase deeper 
still, in the vicinity of 1000-km depth. This result 
is particularly intriguing given the observation 



that most actively subducting slabs stagnate be- 
low the 670-km seismic discontinuity, at depths 
of 1000 km (10). For instance, both the GAP-P4 
model (11) and SEMUCB-WM1 reveal slabs stag- 
nating above the 670-km discontinuity in the North- 
ern Honshu arc, but passing through the 670-km 
discontinuity and stagnating above 1000-km depth 
along the Tonga and Kermadec arcs. In at least 
one region, Central America, the slab appears to 
enter the lower mantle without stagnation. The 
mechanism responsible for this slab stagnation 
is unclear, as there is no velocity discontinuity at 
this depth in one-dimensional (ID) seismic mod- 
els (12), nor a known phase transition. 

Two mechanisms have been recently suggested 
for slab stagnation in the mid-mantle. First, 
King et al. (13) have suggested that the pyroxene 
to majoritic garnet phase transition in subducted 
slabs is kinetically hindered, and thus older, colder, 
slabs are more prone to stagnation. Marquardt 
and Miyagi (14), based on high-pressure defor- 
mation experiments of (Mg,Fe)0, argued that vis- 
cosity in the regions surrounding settling slabs 
in the shallow-most 900 km of the lower mantle 
may be about two orders of magnitude higher 
than previously expected, causing slabs to spread 
laterally and to settle very slowly through this re- 
gion. Our results indicate that there may be a vis- 
cosity increase in the mid-mantle, and many of 



our inversions have viscosity contrasts at depths 
comparable to those suggested (14). However, we 
note that the observation of regional differences 
in slab behavior, and in particular the speculation 
that old, cold, slabs preferentially stagnate, cannot 
be explained using our ID viscosity structure or 
by a viscosity contrast that would occur in the 
mantle surrounding all slabs, irrespective of age, 
without invoking additional mantle dynamic pro- 
cesses or subduction zone histories, such as the 
prevalence of trench rollback. 

Previous inversions for layered viscosity struc- 
ture with prescribed layer interfaces depths re- 
vealed some indication of an increase in viscosity 
at or around 1000-km depth. In particular, King 
and Masters (15) inverted for layered viscosity struc- 
ture constrained by the geoid using a uniform 
velocity to density conversion factor, with veloc- 
ity anomalies inferred from 5-wave tomographic 
models, and found evidence for a viscosity in- 
crease of ~20 at 670-km depth and a second in- 
crease of ~5 at 1022-km depth. Forte and Peltier 
(16) also found, using a combination of a slab 
density model and lower-mantle tomographic 
model, that the agreement between modeled and 
observed geoid was better for a layered viscosity 
structure with an interface at 1200-km depth than 
at 670-km depth. Kido et al. (17) performed in- 
versions for layered mantle viscosity structure 
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Fig. 1. Properties of ensemble solution. Viscosity inversion using depth-dependent R PiS from HeFESTo, / max = 3, and assumption of uncorrelated errors 
yields radial viscosity profiles with a viscosity increase at 1000-km depth and a lower-viscosity channel between 670 and 1000 km. (A) A 2D histogram 
showing the posterior likelihood of viscosity and depth values. Horizontal dotted lines indicate depths of 670 and 1000 km. (B) A 2D histogram showing 
the posterior likelihood of layer interface depth and viscosity increase (>1 means viscosity increases with increasing depth). (C) Posterior likelihood of 
having a layer interface at each depth. (D) Distribution of residuals of solutions in ensemble solution. (E) Distribution of number of layers in models in the 
ensemble solution. 



1350 11 DECEMBER 2015 • VOL 350 ISSUE 6266 



sciencemag.org SCIENCE 



RESEARCH \ REPORTS 



(with prescribed layer depths) using a genetic 
algorithm and found evidence for a decrease in 
viscosity at 670-km depth and subsequent in- 
crease in viscosity at 1000-km depth. Our study is 
different in that we do not prescribe at the outset 
the number or locations of layer interfaces in our 
layered viscosity structure and as a result, we place 
the largest viscosity contrast in the model some- 
what deeper than previous studies. 

Many studies from the 1980s and 1990s em- 
ployed layered structures with layering identical 
to that of the tomographic models then available 
(~11 layers), or layered structures with layers at 
the major seismic discontinuities. Subsequent 
models have introduced additional layers [for 
instance, 25 in (18)]. To justify such parameter- 
izations, either additional observational constraints, 
such as rates of glacial isostatic adjustment, plate 
motions, or patterns of seismic anisotropy, or 
additional assumptions about the smoothness 
of the mantle viscosity structure, are required. 
Paulson et ol. (19, 20) used geoid and relative sea- 
level data as constraints on a Monte-Carlo inver- 
sion for mantle viscosity structure with one, two, 
and three layers. One of the central conclusions 
was that the GRACE and relative sea-level data 
cannot be used to uniquely constrain a layered 
mantle viscosity structure with more than two 
layers. Two markedly different two-layer models 
were permitted by these inversions (with prescribed 
interface depth at 670 km), one having an upper 
mantle with viscosity around 5 x 10 20 Pa-s and a 
lower mantle -4.33 more viscous and the other 
having an upper mantle viscosity about an order 
of magnitude smaller and a viscosity contrast of 
-1500, similar to what was found by Ricard et al. 
(21). Our results generally support the suggestion 
that the geoid alone cannot uniquely constrain 
the viscosity of more than a handful of layers. In- 
deed, many individual models in the posterior 
population for each of our inversions do have 
more than five layers (e.g., Fig. 1), but owing to 
trade-offs, the layer properties of these more com- 
plex structures cannot be uniquely constrained. 
The posterior distribution of solutions inherently 
captures these trade-offs between model param- 
eters, and the precise viscosity structures of these 
inversions are largely dependent on assumptions 
in the inversion (7). 

A viscosity contrast at 1000-km depth has im- 
portant implications for the dynamics of convec- 
tion in Earth’s mantle, including its thermal and 
chemical evolution. As ascending plumes en- 
counter abrupt changes in viscosity (in numeri- 
cal models), they can be laterally deflected and 
thinned. Similarly, downwellings in numerical simu- 
lations become elongated laterally and compressed 
vertically as they encounter viscosity increases. 
Deflection of upwellings is observed in some tomo- 
graphic models. For instance, recent tomographic 
images obtained by full waveform tomography 
with sophisticated forward-modeling approaches 
reveal apparent deflection at 1000-km depth of 
the seismically slow structures both regionally 
beneath the Iceland hotspot (22) and globally (23). 
Indeed, examples of apparent deflected upwell- 
ings, such as the feature beneath the Macdonald 



hotspot in the South Pacific (Fig. 3), are globally 
not uncommon (23). In both studies (22, 23), the 
apparent radius of plumes also decreases from 
the lower to the upper mantle. The decrease in 
radius appears to be coincident with the deflec- 
tion at 1000-km depth. Upwelling structures in 
numerical simulations of mantle convection with 
an imposed increase in viscosity at 1000-km depth 
show similar behavior (Fig. 3). 

Other studies use the mantle radial correlation 
function (24) to analyze tomographic models and 
to compare tomographic and geodynamic models 
(24, 25). Radial correlation functions calculated 
for SEMUCB-WM1, as well as for the global P-wave 
tomographic model GAP-P4 (10) for spherical 
harmonic degrees 1 to 3 (Fig. 4, A and B), show 
a high degree of correlation throughout the 
lower mantle at depths greater than 1000 km 



and a rapid decrease in correlation at 1000-km 
depth. Nearly identical behavior is also present 
in the average of 8-wave tomographic models 
SMEAN (25) (fig. S10). Other tomographic mod- 
els show a change in radial correlation around 
this depth as well as a change in velocity hetero- 
geneity, particularly at spherical harmonic degree 
4 (25), and an independent test based on voxel 
tomography favors a vertical coherence minimum 
around 800-km depth, below the base of the 
transition zone (26). 

Changes in the radial correlation function may 
be related to changes in viscosity. Numerical simu- 
lations of convection in spherical shell geometry 
show that endothermic phase changes (24) and 
depth-dependent viscosity can both cause corre- 
sponding changes in the radial correlation. We 
find that a viscosity increase at 1000 km (Fig. 4C) 




Fig. 2. Results from mul- 
tiple inversions. Mean 
radial profiles of viscosity 
obtained in eight inver- 
sions varying R PtS , I max, 
and eliminating buoyancy 
contributions from the 
lowermost 1000 km of the 
mantle (denoted by a 
superscript “a”) all exhibit 
an increase in viscosity 
between 670- and 
1000-km depth. Models 
with / max = 7 are charac- 
terized by low viscosity in 
the mid-lower mantle. 
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show similar deflection and thinning of upwellings in a numerical geodynamic model with a viscosity 
increase at 1000-km depth. Cool and warm colors trace dimensionless temperature variations in (B) and 
denote seismically fast or slow regions in (A). 



Fig. 3. Observed and 
modeled upwellings. 

(A) Shear velocity 
anomaly isocontours 
delineate deflected 
downwellings at 
1000-km depth 
(horizontal line) near 
McDonald hotspot 
in SEMUCB-WM1. 
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Fig. 4. Radial correlation functions of tomographic and geodynamic models. (A) Radial correlation functions for spherical harmonic degrees 1 to 3 from 
SEMUCB-WM1 and (B) GAP-P4 show an abrupt decorrelation of structure across 1000-km depth. Very similar radial correlation functions are seen in the 
temperature field from numerical mantle convection simulations with imposed plate motions, including a viscosity contrast at 1000-km depth (C), but not 
when the viscosity contrast is smaller and shallower, at 670-km depth (D). 



yields a radial correlation structure much more 
similar to that found in tomographic models 
(Fig. 4, A and B) than does a viscosity increase at 
670 km (Fig. 4D). The rapid change in radial cor- 
relation at 1000-km depth in tomographic mod- 
els thus suggests a contrast in viscosity, because 
no change in phase is known to occur at this depth. 
We emphasize that these models include simpli- 
fied representations of mantle viscosity structure 
(fig. S7) and that a more gradual increase in vis- 
cosity may also be compatible with the observa- 
tions. Other, more complex viscosity structures 
can also alter the behavior of upwellings and down- 
wellings and consequently change the radial cor- 
relation structure. Convection simulations run with 
a “second asthenosphere,” a weak zone extend- 
ing from 670- to 1000-km depth as suggested 
in some of our inversions (Fig. 1) as well as in in- 
versions by Kido et al. (77), show a greater ten- 
dency toward layered convection (27), which 
promotes decorrelation. 

The viscosity contrast at a 1000-km depth pro- 
vides a physical mechanism for the observation 
that slabs and plumes stagnate or become deflected 
deeper than the transition zone in the absence of 
a pervasive compositional barrier or another en- 
dothermic phase change. It may also reconcile 
observations of changes in seismic structure (28) 
that led to a proposed hot abyssal layer (29), 
though this was originally placed at greater depths. 
Given the present state of understanding in 
mineral physics, no unique mechanism can be 
identified for this increase in viscosity, and our 
observation should motivate further experimen- 
tal and computational studies. First principles cal- 
culations have indicated a continuous though 
gentle increase in the viscosity of bridgmanite due 
to greater vacancy diffusion starting at around 
40 GPa (-1000 km) and continuing until the 
postperovskite phase transition (30). The increase 
in the strength of ferropericlase observed by 
Marquardt and Miyagi (14) is the first positive 
experimental evidence for a possible change in 
rheology at these depths. Whether this effect, 
which is localized in high- strain-rate regions 
(surrounding slabs), should be expected to con- 
tribute to the viscosity inferred on the basis of the 
very-long-wavelength components of the geoid, 



remains to be determined. The spin transition in 
ferropericlase occurs at much greater depths, and 
first-principles simulations suggest that the higher- 
pressure phase (low spin) should have increased 
diffusion and lower viscosity (31), with a viscosity 
minimum near 1500-km depth (32). 

Two possible intriguing (though speculative) 
solutions remain. Changes in the relative abun- 
dance of ferric versus ferrous iron due to dis- 
prortionation (33) at these depths or gradually 
over a depth range might change the bonding 
strength in bridgmanite enough to markedly 
strengthen it. Perhaps of greater interest and of 
more pervasive dynamical consequence might be 
the gradual drying of the bridgmanite perovskite 
as the solubility of water in the structure decreases 
with pressure (34), becoming more viscous at 
1000-km depth. 
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SUPERCONDUCTIVITY 

Evidence for two-dimensional Ising 
superconductivity in gated MoS 2 

J. M. Lu, 1 O. Zheliuk, 1 1. Leermakers, 2 N. F. Q. Yuan, 3 U. Zeitler, 2 K. T. Law, 3 J. T. Ye 1 * 

The Zeeman effect, which is usually detrimental to superconductivity, can be strongly protective 
when an effective Zeeman field from intrinsic spin-orbit coupling locks the spins of Cooper pairs 
in a direction orthogonal to an external magnetic field. We performed magnetotransport 
experiments with ionic-gated molybdenum disulfide transistors, in which gating prepared 
individual superconducting states with different carrier dopings, and measured an in-plane 
critical field B c2 far beyond the Pauli paramagnetic limit, consistent with Zeeman-protected 
superconductivity. The gating-enhanced B c2 is more than an order of magnitude larger than it 
is in the bulk superconducting phases, where the effective Zeeman field is weakened by 
interlayer coupling. Our study provides experimental evidence of an Ising superconductor, in 
which spins of the pairing electrons are strongly pinned by an effective Zeeman field. 



I n conventional superconductors, applying a 
sufficiently high magnetic field above the 
upper critical field B c 2 is a direct way to de- 
stroy superconductivity by breaking Cooper 
pairs via the coexisting orbital and Pauli para- 
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magnetic mechanisms. The orbital contribution 
originates from the coupling between the mag- 
netic field and the electron momentum, whereas 
the paramagnetic contribution is caused by spin 
alignment in Cooper pairs by an external mag- 
netic field. When the orbital effect is weakened 
or eliminated, either by having a large electron 
mass (i) or by reducing dimensionality (2), B c2 is 
solely determined by the interaction between the 
magnetic field and the spin degree of freedom 
of the Cooper pairs. In superconductors where 
Cooper pairs are formed by electrons with oppo- 
site spins, aligning the electron spins by the ex- 
ternal magnetic field increases the energy of the 



system; therefore, B c2 cannot exceed the Clogston- 
Chandrasekhar limit ( 3 , 4) or the Pauli para- 
magnetic limit (in units of tesla), B p ~ 1.86 T c (0). 
Here, T c (0) is the zero-field superconducting 
critical temperature (in units of kelvin) that char- 
acterizes the binding energy of a Cooper pair, which 
competes with the Zeeman splitting energy. 

However, in some superconductors, the Pauli 
limit can be surpassed. For example, forming 
Fulde-Ferrell-Larkin-Ovchinnikov states with in- 
homogeneous pairing densities favors the presence 
of a magnetic field, even above B p (5). In spin- 
triplet superconductors, the parallel-aligned spin 
configuration in Cooper pairs is not affected by 
Pauli paramagentism, and B c2 can easily exceed 
B v (6-8). Spin-orbit interactions have also been 
shown to align spins to overcome the Pauli limit. 
Rashba spin-orbit coupling (SOC) in noncentro- 
symmetric superconductors will lock the spin to 
the in-plane direction, which can greatly enhance 
the out-of-plane B c2 (9); however, for an in-plane 
magnetic field, B c2 can only be moderately en- 
hanced to y/2 B p (10). Alternatively, electron spins 
can be randomized by spin-orbit scattering (SOS), 
which weakens the effect of spin paramagnetism 
(11-16) and hence enhances B c2 . 

Superconductivity in thin flakes of MoS 2 can 
be induced electrostatically using the electric field 
effect, mediated by moving ions in a voltage-biased 
ionic liquid placed on top of the sample [section 1 of 
(16); (77)]. Negative carriers (electrons) are induced 
by accumulating cations above the outermost layer 
of an MoS 2 flake, forming a capacitor ~1 nm thick 
(17-22). The potential gradient at the surface 
creates a planar homogenous electronic system 
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Fig. 1. Inducing superconductivity in thin flakes of MoS 2 by gating. (A) Conduction-band electron 
pockets near the K and K' points in the hexagonal Brillouin zone of monolayer MoS 2 . Electrons in 
opposite K and K' points experience opposite effective magnetic fields B e ff and -B eff , respectively 
(green arrows). The blue and red colored pockets indicate electron spins oriented up and down, 
respectively. (B) Side view (left) and top view (right) of the four outermost layers in a multilayered 
MoS 2 flake. The vertical dashed lines show the relative positions of Mo and S atoms in 2H-type 
stacking. In-plane inversion symmetry is broken in each individual layer, but global inversion sym- 
metry is restored in bulk after stacking. (C) Energy-band splitting caused by B e ff. Blue and red bands 
denote spins aligned up and down, respectively. Because of 2H-type stacking, adjacent layers have 
opposite B eff at the same K points. (D) The red curve (left axis) denotes the theoretical carrier 
density n 2D for the four outermost layers of MoS 2 (26) for sample Dl, when T c ( 0) = 2.37 K. In the phase 
diagram (right axis), superconducting states with different values of T c ( 0) are color-coded; the same 
color-coding is used across all figures. Here, T c is determined at the temperature where the 
resistance drop reaches 90% of f? N at 15 K.This criterion is different from the 50% R N criterion used 
in the rest of the paper; it was chosen to be consistent with that used in the phase diagram of (17). 
(E) Temperature dependence of R s , showing different values of T c corresponding to superconducting 
states (from samples Dl and D24) denoted in (D). 
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with an inhomogeneous vertical doping profile, 
where conducting electrons are predominantly 
doped into a few of the outermost layers, forming 
superconducting states near the K and K' valleys 
of the conduction band (Fig. 1A). The in-plane 
inversion symmetry breaking in a MoS 2 mono- 
layer can induce SOC, manifested as a Zeeman-like 
effective magnetic field B eff (-100 T) oppositely 
applied at the K and K' points of the Brillouin 
zone (23). Because electrons of opposite momen- 
tum experience opposite B eff , this SOC is then 
compatible with Cooper pairs also residing at 
the K and K' points (24). Therefore, spins of 
electrons in the Cooper pairs are polarized by 
this large out-of-plane Zeeman field, which is 
able to protect their orientation from being 
realigned by an in-plane magnetic field, leading 
to a large in-plane B c2 . This alternating spin con- 
figuration also provides the essential ingredient 
for establishing an Ising superconductor, where 
spins of electrons in the Cooper pairs are strongly 
pinned by an effective Zeeman field in an Ising- 
like fashion. 
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Because of the alternating stacking order in 
2iT-type single crystals of transition metal di- 
chalcogenide (TMD) (Fig. IB), electrons with the 
same momentum experience B eff with opposite 
signs for adjacent layers, which weakens the effect 
of SOC by cancelling out B eff mutually in the bulk 
crystal (Fig. 1C) (a comparison with bulk interca- 
lated TMD is given in section 7 of (id)]. However, 
field-effect doping can strongly confine carriers to 
the outermost layer, reaching a two-dimensional 
(2D) carrier density n 2 D of up to -10 14 cm -2 (17, 25). 
Theoretical calculations for our devices indicate that 
the n 2D of individual layers decays exponentially 
from the channel surface (Fig. ID, left axis), reduc- 
ing the 7Z2 D of the second-to-outermost layer by 
almost 90% in comparison with the outermost one 
(26). From the established phase diagram (17), if 
superconductivity is induced close to the quantum 
critical point (QCP; n^D ~ 6 x 10 13 cm -2 ), the second 
layer is not even metallic, because metallic transport 
can be observed only when 7Z 2D > 8 x 10 12 cm -2 . 
Therefore, the outermost layer is well isolated by 
gating, mimicking a freestanding monolayer (27). 
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We obtained superconducting states across a 
range of doping concentrations (Fig. ID, right 
axis) by varying the gate voltage (17); these states 
have different temperature dependences of sheet 
resistivity R s (Fig. IE). A superconducting state 
[T c (at B = 0) = 2.37 K] at the onset of supercon- 
ductivity (close to QCP) could be induced without 
suffering from the inhomogeneity usually en- 
countered at low doping concentrations (Fig. IE, 
red curve). Consistently, this well-behaved state 
also exhibits a high mobility of -700 cm 2 /Vs 
(measured at T = 15 K) before reaching zero 
resistance. 

Angle-resolved photoemission spectroscopy 
(ARPES) measurements (27, 28) and theoretical 
calculations (25, 29) both showed that electron 
doping starts near the K points of the conduction 
band. The band structure is modified at higher 
doping (25, 29), meaning that the simplest super- 
conducting states in MoS 2 , which are dominated 
by Cooper pairs at the K and K' points, should be 
prepared by minimizing doping [higher doping 
states are discussed in section 7 of (16)]. 



E 




Fig. 2. 2D superconductivity in gated MoS 2 (sample D24). Temperature 
dependence of R s under a constant out-of-plane (A) and in-plane (B) mag- 
netic field, up to 12 T. In (B), the left inset shows a close-up view of the data 
near R N /2 within 1 K. In the right inset, 0 is the angle between the B field and 
the MoS 2 surface. (C) Angular dependence of R s , where the dashed line denotes 
R s = R n / 2. In the inset, the data are shown in detail within ±1° of the in-plane field 
configuration (0 = 0°). (D) Angular dependence of B c2 , which is fitted by both the 



2D Tinkham model (red) and the 3D anisotropic GL model (blue). In the inset, the 
angular dependence of B c2 is shown in detail within ±1° of the in-plane field con- 
figuration (0 = 0°). (E) The V-l relationship at different temperatures close to T c , 
plotted on a logarithmic scale. The black lines are fits close to metal-superconductor 
transitions. The long black line denotes V oc / 3 which gives T B kt- (F) Temperature 
dependence of a from fitting the power law dependence of V / a from the 
black lines in (E). T B kt = 6.3 K is obtained for a = 3. 
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The charge distribution of our gated system 
implies that the superconducting state thus formed 
should exhibit a purely 2D nature. To demonstrate 
this dimensionality, we have characterized sample 
D24 [with r c (0) = 7.38 K] with a series of mea- 
surements. The temperature dependences of R s 
under out-of- and in-plane magnetic fields (Fig. 2, 
A and B) are highly anisotropic. The angular de- 
pendence of B c2 at T = 6.99 K (Fig. 2D) was ex- 
tracted from Fig. 2C. Curves fitted with the 2D 
Tinkham formula (red curve) (30) and the 3D 
anisotropic Ginzburg-Landau (GL) model (blue 
curve) (2) show that for 0 > ± 1° (where 0 is the 
angle between the B field and the MoS 2 surface), 
the data are consistent with both models, whereas 
for 0 < + 1° (Fig. 2D, inset), the cusp-shaped de- 
pendence can only be explained with a 2D model. 



These measurements show that our system ex- 
hibits 2D superconductivity, similar to LaA10 3 / 
SrTi0 3 interfaces (31) and ion-gated SrTi0 3 sur- 
faces (32). From the voltage-current (V-I) depen- 
dence at different temperatures close to T c ( 0) (Fig. 
2E), we determined that the Berezinskii-Kosterlitz- 
Thouless temperature r BKT is 6.3 K for our 2D sys- 
tem (Fig. 2F). V-I characteristics in a magnetic field 
(fig. S3) exhibit similar critical behavior to the zero- 
field data, with their T BKr values effectively reduced 
by increasing the magnetic field. 

A moderate in-plane B field of up to 12 T shows 
little effect on the superconducting transition 
temperature [where T c (0) = 7.38 K and the Pauli 
limit B P = 13.7 T (Fig. 2B)]; thus, the B c2 of the 
system must be far above Bp. To confirm this, we 
performed a high field measurement up to 37 T 
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Fig. 3. Determining the in-plane upper critical field B c2 at different T c (samples D1 and D24). (A) Mag- 
netoresistance of sample D1 [with T c ( 0) = 2.37 K near the onset of the superconducting phase] as a 
function of an in-plane magnetic field up to 37 T, at various temperatures. (B) Temperature dependence of 
R s for sample D1 [with T c ( 0) = 7.64 K] under different in-plane magnetic fields up to 32.5 T. The dashed 
lines in (A) and (B) indicate R N /2. B c2 is determined as the intercept between dashed lines and R s curves. 
(C) Temperature dependence of B c2 for superconducting states induced in sample D1 with different T c 
[solid circles; colors follow (D)]. The B c2 for alkali metal-intercalated bulk MoS 2 compounds is from (41) 
and is shown for comparison. The B c2 for gate-induced states is fitted as a function of temperature using 
the 2D GL (solid line) and KLB (dashed line) models. (D) B c2 normalized by B p , as a function of reduced 
temperature T/T c , including superconducting states from alkali-doped bulk phases and gated-induced 
phases (samples D1 and D24). The dashed line denotes B p and sets the boundary of the Pauli limited 
regime (shaded). 
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[section 2 of (16)] on sample D1 after observing a 
steep increase in B c2 near T c ( 0) = 5.5 K (Fig. 3C, 
green dots). By controlling the gating strength, 
superconducting states with T c (0) = 2.37 and 7.64 K 
were induced in sample Dl. For T c ( 0) = 2.37 K, we 
obtained B c2 as the magnetic field required to 
reach 50% of the normal state resistivity (R N ) 
(Fig. 3A). B c2 is above 20 T at 1.46 K (Fig. 3C, red 
circles), which is more than four times the B P . The 
data from the second gating [T c (0) = 7.64 K (Fig. 
3B)] show only a weak reduction of T c by ~1 K at 
even the highest magnetic field, 32.5 T (~ 2.3 x B p ). 

The temperature dependences of in-plane B c2 
for sample Dl in three different states (Fig. 3C) 
are fitted using a phenomenological GL theory in 
the 2D limit (2) and the microscopic Klemm- 
Luther-Beasley (KLB) theory (12, 15, 33). The ex- 
trapolated zero-temperature in-plane B c2 is far 
beyond B p for all three superconducting states. 
The zero-temperature B c2 predicted by 2D GL 
theory, without taking spin into account, is larg- 
er than that estimated by the KLB theory, which 
considers both the limiting effect from spin 
paramagnetism and the enhancing effect by the 
SOS from disorder. To fit the data using the KLB 
theory (dashed curves in Fig. 3C), the interlayer 
coupling has to be set to zero. This strongly sug- 
gests that the induced superconductivity is 2D, 
which is consistent with the conclusion drawn 
from Fig. 2 and previous theoretical calculations 
(17, 26) and ARPES measurements (27, 28) re- 
garding predominant doping in the outermost 
layer. Curves fitted with the KLB theory yield a 
very short SOS time of ~24 fs (fig. S5), which is 
less than the total scattering time of 185 fs es- 
timated from resistivity measurements at 15 K 
(table S2) and much shorter than the estimation 
of nanoseconds calculated for MoS 2 at the car- 
rier density range accessed by this work (34). 
Short spin-orbit scattering times of ~40 to 50 fs 
have also been observed in organic molecule- 
intercalated TaS 2 (35-37), (LaSe)u 4 (NbSe 2 ) (38, 39), 
and the organic superconductor K-(ET) 4 Hg 2 89 Br 8 
[ET, bis(ethylenedithio)tetrathiafulvalene] (40). 

The temperature dependence of B c2 in bulk 
superconducting MoS 2 intercalated by alkali 
metals (41) near T c ( 0) is linear instead of square 
root (Fig. 3C). The slight upturn of B c2 toward 
lower temperatures away from T c ( 0) is the evi- 
dence of crossover from 3D to 2D supercon- 
ducting states (12, 33, 36-38) caused by the 
layered nature of the bulk crystal. In these bulk 
phases, the measured B c2 values are much small- 
er than or comparable (when Cs dopants are in- 
tercalated) to B p (41). 

This behavior is visualized in Fig. 3D, where 
the in-plane B c2 normalized by B p for bulk su- 
perconducting phases falls within the shaded 
area bounded by the Pauli limit. In contrast, all 
gate-induced phases (from samples Dl and D24) 
are far above both B p (dashed line) and bulk- 
phase B c2 . The Dl with T c ( 0) = 2.37 K, which is 
separated from the other gate-induced states, 
exhibits the largest enhancement. If the large SOS 
rate extracted from the KLB fitting (Fig. 3C) were 
the reason for the enhancement of B c2 in gate- 
induced phases, we would expect it to also enhance 
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B c 2 in the bulk phases. The difference shown in Fig. 
3D indicates that SOS is unlikely to be the origin 
of the enhancement of B c2 in the gated phases. 

Excluding SOS as the principal mechanism for 
the strong enhancement of the in-plane B c2 , and 
taking into account recent developments in un- 
derstanding the band structures of monolayer 
MoS 2 (42, 43), we propose that this B c2 enhancement 
is mainly caused by the intrinsic spin-orbit coupl- 
ing in MoS 2 . Near the K points of the Brillouin 
zone (Fig. 1A) and on the basis of spin-up and -down 
electrons the normal-state Hamiltonian 

of monolayer MoS 2 in the presence of an external 
field can be described by (24) 

77(k + eK) = e* + ep so a^ + a R g F • c + b • a (1) 

Here, ^ -g denotes the kinetic energy 
with chemical potential p; k = (k M k y , 0) is the 
kinetic momentum of electrons in the K and K 
valleys; K is the kinetic momentum of the K val- 



ley; m is the effective mass of the electrons; o = 
(g x , a y , a z ) are the Pauli matrices; g F = (k y , -k x , 0) 
denotes the Rashba vector dying in-plane); a R 
and p so are the strength of Rashba and intrinsic 
SOC, respectively; e = ±1 is the valley index (1 at 
the K valley and -1 at the K' valley); and h = p B B 
is the external Zeeman field (where p B is the Bohr 
magneton). The intrinsic SOC term ep so a^ due to 
in-plane inversion symmetry breaking, induces 
an effective magnetic field pointing out of the 
plane (z direction), which has opposite signs at 
opposite valleys (green arrows in Fig. 1A). This 
Zeeman-like effective magnetic field B e ff = ePso§/ 
gp B (g, gyromagnetic ratio; z, unit vector in the 
out-of-plane direction) will only appear in our 
multilayered system after applying a strong electric 
field, which isolates the outermost layers from the 
other layers (17, 44), thus mimicking a monolayer 
system. The large electric field generated by gating 
reaches ~50 million volts/cm (17) in our system, 



causing additional out-of-plane inversion symme- 
try breaking and creating a Rashba-type effective 
magnetic field B Kn = a R g F /gg B . 

The total energy in a magnetic field is schem- 
atically shown in Fig. 4, A to D. If the electron 
spin aligned by B eff (BrJ stays parallel to the 
external magnetic field B ex (Fig. 4, A and C), the 
system gains energy through coupling between 
spin and external fields as p B B ex . Therefore, B c2 is 
limited byB p (Fig. 4A), or it can reach y/2 B p (Fig. 
4C) when coupling is reduced in a Rashba-type 
spin configuration (10). When B eff and B Ra are 
perpendicular to B ex , as respectively shown in 
Fig. 4, B and D, the spin aligned by both effective 
fields is orthogonal to B ex . Hence, the coupling 
between spin and B ex is minimized, and B c2 can 
easily surpass B p in these two cases. 

To theoretically describe our system when sub- 
jected to an in-plane external magnetic field 
(combining the cases shown in Fig. 4, B and 




Fig. 4. Interplay 
between an external 
magnetic field and the 
spins of Cooper pairs 
aligned by Zeeman and 
Rashba-type effective 
magnetic fields. (A to 
D) Illustration of the 
acquisition of Zeeman 
energy through coupling 
between an external 
magnetic field and the 
spins of Cooper pairs 
formed near the K and K' 
points of the Brillouin 
zone (not to scale). When 
Rashba or Zeeman SOC 
aligns the spins of 
Cooper pairs parallel to 
the external field, the 
increase in Zeeman 
energy due to parallel 

coupling between the field and the spin eventually can cause the pair to break [(A) and 
(C)]. In (B) and (D), the acquired Zeeman energy is minimized as a result of the 
orthogonal coupling between the field and the aligned spins, which effectively protects the 
Cooper pairs from depairing. (E) Theoretical fitting of the relationship between B c2 /B p and 
T/T c for samples D1 [T c (0) = 2.37 K and 5.5 K] and D24 [T c (0) = 7.38 K], using a fixed 
effective Zeeman field (p S o - 6.2 meV) and an increasing Rashba field (a R k F ranges from 
10 to -50% of Pso) [section 6 of (16)]. Two dashed lines show the special cases calculated 
by equation S3, when only the Rashba field (a R k F = 30 meV; p S o = 0) is considered (red), 
and when both the Zeeman and Rashba fields are zero (black). In the former case, a large 






a R kf causes a moderate increase of B c2 to ~V2B p (10). In the latter case, the conventional 
Pauli limit at zero temperature is recovered. (F) Plot of B c2 versus T c for different super- 
conductors [a magnetic field was applied along crystal axes a, b, or c or to a polycrystalline 
(poly)]- The data shown are from well-known systems including noncentrosymmetric (pink 
circles), triplet (purple squares) (6, 8, 9), low-dimensional organic (green triangles) (40, 50-52), 
and bulk TMD superconductors (blue triangles) (35-38, 47). The robustness of the spin 
protection can be measured by the vertical distance between B c2 and the red dashed line 
denoting B p . Gate-induced superconductivity from samples D1 and D24 are among the 
states with the highest B c2 /B p ratio. In (LaSe)ii 4 (NbSe 2 ), T c was determined at 95% of 
R n ; T c in organic molecule-intercalated TMDs was obtained by extrapolating to zero 
resistance; and all other systems use the standard of 50% of R N . 
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C), we introduced the pairing potential terms 
n + h.c. into H( k) and solved the self- 
consistent mean field gap equation [section 6 
of (16); h.c., hermitian conjugate]. The in-plane 
B c2 for a sample with a given T c can then be 
determined by including the intrinsic SOC term 
p so and the Rashba energy a R /c F , where k F is the 
Fermi momentum. 

For the most extensive data set from sample D1 
[r c (0) = 2.37 K], the relationship between B c2 /B p 
and the reduced temperature T/T c , shown in 
Fig. 4E, can be fitted well with p so = 6.2 meV and 
a R k F = 0.88 meV. The value obtained for p so 
corresponds to an out-of-plane field of -114 T, 
which is comparable to the value expected from 
theoretical calculation at the K point (3 meV) (23). 
The Rashba energy obtained can be regarded as 
an upper bound, because the present model does 
not include impurity scattering, which can also 
reduce B c2 (45). 

The scale of B c2 enhancement is determined 
by a destructive interplay between intrinsic p S o 
and a R & F . Reaching higher r c (0) requires stron- 
ger doping under higher electric fields, with a 
concomitant increase of B Ra . As a result of this 
competition, the in-plane B c2 protection should 
be weakened with the increase of T c (0). To sup- 
port this argument, we chose two other super- 
conducting samples that showed consecutively 
higher r c (0) (from D1 and D24). By assuming 
identical (3 so (6.2 meV), B c2 from D1 with r c (0) = 
5.5 K and B c2 from D24 with T c ( 0) = 7.38 K can be 
well fitted using a K k F = 1.94 and 3.02 meV, 
respectively; these values are consistent with the 
expected increase of a K k F with T c ( 0) (Fig. 4E). 

The effective Zeeman field and its orthogonal 
protection in individual layers can also be in- 
duced by reducing the interlayer coupling in 
bulk superconducting TMDs (33, 35, 38, 46, 47). 
Therefore, a large in-plane B c2 was also observed 
in bulk when lattice symmetry was lowered by 
intercalating organic molecules and alkali elements 
with large radii (Cs-intercalated MoS 2 shows the 
highest B c2 among bulk phases in Fig. 3D) or by 
forming a charge density wave (46). 

We compared our B c2 results with those ob- 
tained from other superconductors with enhanced 
B c2 under their maximum spin protection along 
the labeled crystal axis (Fig. 4F); we found that the 
Zeeman field-protected states in our samples are 
among the states that are most robust against 
external magnetic fields. Given the very similar 
band structures found in 2F/-type TMDs with 
universal Zeeman-type spin splitting and the re- 
cent successes in inducing more TMD supercon- 
ductors using the field effect (17, 48, 49), we would 
expect a family of Ising superconductors in 
277-type TMDs. The concept of the Ising super- 
conductor is also applicable to other layered sys- 
tems, where similar intrinsic SOC could be induced 
by symmetry breaking. 
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ICE SHEETS 

Fast retreat of Zachariae Isstrom, 
northeast Greenland 

J. Mouginot, 1 * E. Rignot, 1 ’ 2 B. Scheuchl, 1 1. Fenty, 2 A. Khazendar, 2 M. Morlighem, 1 
A. Buzzi, 1 J. Paden 3 

After 8 years of decay of its ice shelf, Zachariae Isstrom, a major glacier of northeast 
Greenland that holds a 0.5-meter sea-level rise equivalent, entered a phase of accelerated 
retreat in fall 2012. The acceleration rate of its ice velocity tripled, melting of its residual ice 
shelf and thinning of its grounded portion doubled, and calving is now occurring at its 
grounding line. Warmer air and ocean temperatures have caused the glacier to detach 
from a stabilizing sill and retreat rapidly along a downward-sloping, marine-based bed. 

Its equal-ice-volume neighbor, Nioghalvfjerdsfjorden, is also melting rapidly but retreating 
slowly along an upward-sloping bed. The destabilization of this marine-based sector will 
increase sea-level rise from the Greenland Ice Sheet for decades to come. 



Z achariae Isstrom (ZI) and Nioghalvfjerds- 
fjorden glacier (NG), in northeast Greenland, 
drain a sector 198,380 km 2 in size, or 12% of 
the Greenland Ice Sheet (1). These two gla- 
ciers together drain the northeast Greenland 
ice stream, the only large, dynamic feature that 
extends continuously deep to the ice sheet interior 



near Greenland’s summit (2). This marine-based 
sector holds a 1.1-m sea-level rise equivalent (3) 
(Fig. ID). 

We constructed a high-resolution bed topog- 
raphy of both glaciers (Fig. 1) using a mass con- 
servation method over grounded ice (3) and 
airborne gravity inversion (4) over floating ice. 
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On ZI, we find that the grounding line in year 
1996 (5) was positioned 450 m below sea level 
(bsl), on a previously unknown sill that crosses 
the entire glacier width. Seaward of the sill, the 
seafloor drops to 800-m bsl (Fig. ID). Inland of 
the sill, the glacier bed remains between 400 and 
700 m bsl for 30 km. The bed then rises to reach a 
ridge at sea level. The ridge is cut across by a 
300-m-deep channel that connects with inte- 
rior regions, where the bed remains 300 m bsl 
for another 150 km. On NG, the 1996 grounding 
line was 600 m bsl. We find no sill, and the bed 
is sloping upward until 45 km inland. Seismic 
data collected in the 1990s ( 6 ) indicate that the ice 
shelf floats on a 900-m bsl cavity. The seafloor 
rises to 200 m bsl to the east, where the ice front 
is anchored by islands and ice rises, and 600 m 
bsl to the north into Dijmphna Sund. 
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We use Landsat optical imagery (fig. SI) to doc- 
ument the ice-front positions over the past 40 
years. ZI ice shelf was stable until 2002-2003, 
when a large section broke off (7) and ice debris 
cleared from Jokelbugten fjord. The ice front 
retreated steadily until late 2012, when the north- 
ern and southern floating sections became dis- 
connected. In 2013-2014, the ice-front retreat 
accelerated markedly and the glacier started to 
calve at its grounding line. By December 2014, 
the remaining shelf was 52 km 2 in size, or 95% 
smaller than in 2002. Meanwhile, the calving 
front of NG retreated by only a few km between 
2002 and 2012 (7). 

We map the glacier grounding lines from 1992 
to 2015 (Fig. 1C and figs. S2 and S3) using differ- 
ential satellite radar interferometry (DInSAR). 
The grounding line of ZI retreated by 3.5 km at 
its center between 1996 and 2010, and 3.5 km be- 
tween 2011 and 2015 (Fig. 1C). The mean rate of 
grounding-line retreat therefore quadrupled from 
230 m/year to 875 m/year before and after 2011. 
On NG, the grounding line retreated 1 km be- 
tween 1992 and 2011 and has remained stable 



since. The DInSAR observations reveal a down- 
ward tilting of the ice-front surface of ZI by 75 cm 
between 16 to 20 December and 20 to 24 Decem- 
ber 2014 in a section 1 km wide by 7 km long (Fig. 
1C). We attribute this deformation to a buoyancy- 
driven rotation of the terminus depressed below 
flotation and facilitated by the propagation of 
basal crevasses to the water line (8). 

We document 40 years of surface velocity using 
Landsat and SAR instruments (table SI and fig. 
S4). The results show that after 25 years of stabil- 
ity, the speed of ZI increased by 50% from 2000 
to 2014, with half of that increase taking place 
after 2012 (Fig. 2). The glacier sped up 125 m/year 
every year from 2012 to 2015, or three times as 
fast as between 2000 and 2012. NG exhibited 
smaller changes: Its speed increased by 8% be- 
tween 1976 and 2014, with most of the acceler- 
ation occurring after 2006. The glacier accelerations 
are larger than seasonal variations and extend 
80 km and 15 km upstream of the 1996 grounding 
lines of ZI and NG, respectively, indicating that 
the coastal changes affect a substantial portion of 
the drainage system (Fig. 2). 



Fig. 1. Ice speed, bed 
topography, and ground- 
ing lines of ZI and NG. 

(A) Schematic view with 
Operation IceBridge and 
other NASA mission flight 
tracks in gray, basin bounda- 
ries in black, flux gates in 
thick blue and red (table S2), 
and the profile used in Fig. 2 
in dashed black. (B) Ice sur- 
face speed from 2008 to 
2009 (1), with velocity pro- 
files used in Fig. 3 in black 
dots. The profiles in (A) and 

(B) are different. 

(C) Differential interferogram 
showing the tide-induced 
motion of ZI in December 
2014. The inset shows detail 
about the pattern of tidal 
flexure at the grounding line. 

(D) Bed topography above 
the WGS84 ellipsoid derived 
from mass conservation on 
land (3) and gravity data at 
sea (4). The seafloor 
bathymetry beneath NG ice 
shelf (square) is from seis- 
mic measurements (6). Inset 
shows the drainage 
boundaries of three major 
marine-based basins in 
Greenland (3). 
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Repeat measurements of ice thickness (+10-m 
precision) and surface elevation (+10-cm precision) 
using radar and light detection and ranging 
(LIDAR) data (see the supplementary materials) 
from 1995 to 2014 provide precise information 
about ice thickness change during the retreat of ZI. 
About 4.5 km upstream from the 2014 grounding 
line of ZI, the ice-thinning rate doubled from 2.5 + 
0.1 m/year, consistent with (9), to 5.1 + 0.3 m/year 
during 1999 to 2010 and 2010 to 2014, respec- 
tively. On the ice shelf, the change in ice thickness 
is large enough to be directly measured with radar 
(Fig. 3). After correction for dynamic thinning and 
changes in surface mass balance (SMB), we find 
that the ice-shelf thickness at the 1996 and 2011 
grounding lines decreased by 161 + 43 m and 100 + 
50 m, respectively, during 1999 to 2010 and 2010 
to 2014, reflecting enhanced bottom melting by 
the ocean of 14.6 + 4.1 m/year and 25 + 12 m/year 
during those time periods (see the supplementary 
materials). Application of mass conservation on 
the ice shelf indicated that in the 1990s, the steady- 
state bottom melt rates of ZI and NG averaged 8 
and 5 m/year, respectively, and reached 25 m/year 
within 10 km of the grounding lines (5). We con- 
clude that ice-shelf bottom melting doubled in re- 
cent years compared with the 1990s and that half 
of the increase took place between 2010 and 2014. 



On NG, 3.7 km upstream of the 1996 grounding 
line, ice thinned 0.9 ± 0.1 m/year and 1.4 + 0.5 m/ 
year during 1999 to 2012 and 2012 to 2014, respec- 
tively. The radar echograms show that 5 km 
downstream of the grounding line, the ice shelf 
lost 30% of its total thickness (fig. S5). This cor- 
responds to a bottom melting of 13.3 ± 4 m/year 
in the past 15 years, or 50% above the melt rate 
from the 1990s (5). The ice shelf is therefore 
eroding rapidly from the bottom. We hypothe- 
size that the erosion has not translated into an 
inland migration of the grounding line and ice- 
flow acceleration because the bed of NG rises 
inland and the ice-shelf front did not detach 
from bay walls, islands, and ice rises (fig. S5). 

Combining surface velocity and ice thickness, 
we calculate the glacier ice discharge from 1976 
to 2015 (Fig. 4 and fig. S6). On ZI, the ice flux in- 
creased from 10.3 + 1.2 Gt/year in 1976 to 15.4 + 1.7 
Gt/year in 2015, or 50%. On NG, the ice discharge 
increased by 8% from 1976 to 2015. Comparing 
the ice discharge with net accumulation of mass 
over the drainage basins (fig. S7) using the region- 
al climate model MAR (Modele Atmospherique 
Regional) (10) indicates that ZI was in a state of 
mass balance until 2003 and is now losing mass 
at about 5 Gt/year, whereas NG remains close to 
a state of mass balance (Fig. 4). Our discharge 



estimates for ZI supersede the overestimates in 
(7) for the period 1990 to 2012, which employed 
less reliable ice thickness data (figs. S8 and S9). 

The MAR reconstruction shows that the mean 
surface runoff tripled from 0.6 to 1.8 Gt/year, 
respectively, during 1960 to 1990 and 2002 to 
2014 (Fig. 4C) as a result of warmer air temper- 
atures. Higher melting thins ice from the top and 
contributes to grounding-line retreat as floating 
ice achieves hydrostatic equilibrium farther up- 
stream. Meltwater ponding on the ice shelf likely 
contributed to its break-up via hydrofracturing 
(11). Warmer air temperatures melted the ice 
melange that keeps ice floes glued together in the 
fjord (7). Enhanced glacier runoff increased sub- 
glacial freshwater discharge at the grounding line, 
which drives a stronger thermohaline circulation 
at the ice underside and increases the rate of melt 
by the ocean (12). 

Ocean in situ measurements over the period 
1997 to 2010 show an increase of +1°C in mean 
temperature of the warm, salty subtropical-origin 
Atlantic Water (AW) advected from the North 
Atlantic toward the Arctic Ocean via the West 
Spitsbergen Current (WSC) (13). Although resolv- 
ing the transport pathways and water-mass trans- 
formation of AW in the Northeast Atlantic and 
East Arctic Ocean is an area of active research 
(14, IS), it is known that some fraction of these 
warm northward-flowing waters recirculates in 
the northern Greenland Sea and in the southern 
Nansen Basin to join the southward-flowing East 
Greenland Current (EGC) (16). Ocean tempera- 
tures observed from moored instruments span- 
ning Fram Strait at 78°50N from 1997 to the 
present (17) show that temperature anomalies in 
the northward-flowing WSC also appear in the 
southward-flowing EGC (18). Although high- 
resolution (2 to 4 km) ocean simulations show 
that ocean temperature anomalies on the EGC 
propagate from the continental shelf break into 
Belgica Trough to within 50 km of ZI (19), the 
seafloor bathymetry in these critical last 50 km is 
not known well enough to simulate ocean circu- 
lation close to the glacier. Yet observations from 
1996-1997 (6) and 2009 (20) reveal the presence 
of warm AW at the mouth of the NG ice-shelf 
cavity. A 1°C increase in AW would have increased 
bottom melting by 10 (21) to 20 m/year (22), 
which is within the range of our observations. 
We conclude that ocean warming most probably 
played a major role in triggering the glacier re- 
treat, more important than the sea-ice concen- 
tration decrease (7). Oceanographic observations 
near ZI are critically needed to address the effect 
of thermal ocean forcing on the glacier evolution 
in more detail. 

ZI has now transformed into a tidewater glacier 
calving along an ice cliff as a result of warmer air 
and ocean temperatures. The mass loss is driven 
by the increase in ice discharge rather than a 
change in SMB (Fig. 4). The glacier detached from 
a stabilizing sill and retreated into a retrograde 
basin 700 m bsl. Tidewater glaciers are known to 
retreat rapidly along retrograde beds until the 
bed rises again (23). We project that ZI may con- 
tinue retreating rapidly for another 20 to 30 years. 




Distance (km) 

Fig. 2. Ice speed of ZI and NG from 1976 to 2015. (A and B) Ice speed along the profile in Fig. IB 
color-coded from blue (1976) to red (2015). Thick vertical lines locate the grounding lines. The inset 
displays ice speed versus time at the location of the black dashed vertical line. 
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Its ice front will progressively widen from 19 km 
at present to 50 km about 30 km upstream, thereby 
increasing ice discharge. The height of the calving 
cliff will increase from its current 75 m to enhance 
the risk of ice fracture ( 11 ). With the formation of a 
calving cliff, the ocean-induced melt rates will 
increase considerably because buoyant meltwater 
plumes rise faster along a vertical face than along 
a near-horizontal ice-shelf bottom (5, 12 ). Beyond 
30 km, the retreat will be slowed down by a rising 
bed topography, but submarine channels will 
maintain the contact with the ocean into the deep 
interior. 

The ZI/NG sector is one of three major marine- 
based basins in Greenland (fig. S10), along with 



Jakobshavn Isbrae (JI) and Petermann (PG)- 
Humboldt glaciers, each holding a 0.6-m sea-level 
equivalent. JI started a rapid retreat (18 km from 
2001 to 2015) after the collapse of its ice shelf and 
has undergone massive calving events since 2010 
( 24 ) (fig. Sll). The central channel of the PG ice 
shelf lost 250 m of ice from 2002 to 2010, and the 
ice front retreated 33 km from 2010 to 2012 ( 25 ). 
The NG ice shelf will become vulnerable to break- 
up in the near future if thinning continues. These 
observations combined suggest that all three 
major marine-based basins are undergoing sub- 
stantial changes at present. JI and ZI have already 
transitioned to a tidewater glacier regime, with 
increased calf-ice production and ice melting by 
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Fig. 3. Surface elevation and ice thickness of ZI from 1999 to 2014. (A to C) Radar echograms 
along the center-line profile in Fig. 1A. Ice surface and bottom (assuming flotation) from LiDAR data, 
respectively, are white. Bed elevation from radar data is green. (D) Evolution of ZI between 1999 and 
2014, with successive ice-front positions color-coded from dark (1999) to light gray (2014), seawater 
in blue, and bedrock in light brown. Vertical dashed lines locate the grounding lines. 
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Fig. 4. Yearly ice discharge, surface mass bal- 
ance, and runoff of ZI and NG from 1976 to 
2015. (A and B) Ice flux [D] and SMB from the 
regional climate model MAR ( 10 ) for ZI and NG 
color-coded blue and orange, respectively. (C) 
Runoff from MAR ( 10 ). (D) Mass evolution of NG, 
ZI, and both combined. SMBi 960 - 9 o is the mean 
SMB for the time period 1960-1990. 
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the ocean. The retreat of these marine-based sec- 
tors is likely to increase sea-level rise from Green- 
land for decades to come. 
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NEURONAL DYNAMICS 

High-speed recording of neural spikes 
in awake mice and flies with a 
fluorescent voltage sensor 

Yiyang Gong, 1 ’ 2 ’ 3 * Cheng Huang , 1 Jin Zhong Li , 1 ’ 2 Benjamin F. Grewe , 1 ’ 2 
Yanping Zhang , 1 ’ 2 ’ 4 Stephan Eismann , 1 ’ 2 Mark J. Schnitzer 1 ’ 2 ’ 4 * 

Genetically encoded voltage indicators (GEVIs) are a promising technology for 
fluorescence readout of millisecond-scale neuronal dynamics. Previous GEVIs had 
insufficient signaling speed and dynamic range to resolve action potentials in live 
animals. We coupled fast voltage-sensing domains from a rhodopsin protein to bright 
fluorophores through resonance energy transfer. The resulting GEVIs are sufficiently bright 
and fast to report neuronal action potentials and membrane voltage dynamics in awake 
mice and flies, resolving fast spike trains with 0.2-millisecond timing precision at spike 
detection error rates orders of magnitude better than previous GEVIs. In vivo imaging 
revealed sensory-evoked responses, including somatic spiking, dendritic dynamics, 
and intracellular voltage propagation. These results empower in vivo optical studies 
of neuronal electrophysiology and coding and motivate further advancements in 
high-speed microscopy. 



T o dissect the mechanisms of high-speed 
neuronal information processing in the live 
brain, neuroscientists need to track cellular 
and subcellular electrophysiological activity 
with millisecond-scale resolution in identi- 
fied neuron types. Genetically encoded fluores- 
cent Ca 2+ indicators report isolated, individual 
action potentials from many cell types in live 
animals (1, 2). However, Ca 2+ indicators’ slow 
kinetics (~50 to 1000 ms) precludes high-fidelity 
studies of fast-spiking cell types, determinations 
of spike waveforms, resolution of individual spikes 
in fast spike trains, and precise estimates of spike 
timing. Moreover, the magnitude of Ca 2+ influx in 
response to an action potential varies across cell 
types and even within individual cells (1, 2). In 
vivo Ca 2+ imaging also poorly tracks subthreshold 
or dendritic voltage dynamics, due to insensitivity 
to hyperpolarizations and confounds from syn- 
aptic Ca 2+ influx. Organic voltage-sensitive dyes 
typically have much faster kinetics than Ca 2+ 
indicators but are generally highly phototoxic, 
allow neither genetically targeted delivery nor 
long-term imaging studies of single cells, and 
have been incapable of reporting single spikes in 
the live mammalian brain (3). 

GEVIs combine genetic targeting and optical 
readout of transmembrane voltage (3, 4), and in 
principle can sense spikes and subthreshold dy- 
namics. Nevertheless, to date, GEVIs have lacked 
the capabilities to detect individual action po- 
tentials and fast spike trains in live animals (3, 4). 
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Past efforts fused fluorescent proteins to voltage- 
sensitive domains (VSDs) from voltage-sensitive 
phosphatases (5-9) or used Archaerhodopsin 
(Arch), which is both a fast VSD and a dim fluo- 
rophore (10). Although Arch variants work well 
in cultured neurons, the intense illumination re- 
quired (1 to 10 W • mm -2 ) plus the consequent 
heating, autofluorescence, and photodamage have 
precluded imaging studies in intact tissue over 
wide fields of view (10). 

Here, we present fast GEVIs (<l-ms response) 
that fuse the Acetabularia acetabulum rhodop- 
sin (Ace) (11) and mNeonGreen (12) fluorescent 
protein to enable voltage-sensitive fluorescence 
resonance energy transfer (FRET) (Fig. 1A and 
table SI). We previously introduced this “FRET- 
opsin” configuration (13, 14), which combines the 
fast kinetics of a rhodopsin VSD with a bright 
fluorophore and provides high-fidelity membrane 
potential and spike train readouts at illumination 
levels ~50 to 100 times lower than those used with 
Arch indicators. A FRET-opsin indicator based on 
Leptosphaeria maculans (Mac) rhodopsin and 
yellow fluorescent mCitrine reported fast neural 
spiking in brain slices and Purkinje neurons’ 
dendritic activation in live mice (13). These re- 
sults had suggested that optical recordings of ac- 
tion potentials and dendritic voltage dynamics in 
live animals might be attainable. Ace-mNeon in- 
dicators now enable high-fidelity imaging of in- 
dividual spikes and fast spike trains in live mice 
and flies due to their faster kinetics and superior 
brightness compared with all prior GEVIs. Ace is 
about six times as fast as Mac, and mNeonGreen 
has a -50% higher extinction coefficient than 
mCitrine and nearly threefold better photo- 
stability (12). We created Ace mutants (AcelQ and 
Ace2N) with an inactivated proton pump; these 
have blue-shifted absorption spectra compared 
with Mac and Arch (11, 13), yielding superior 
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FRET acceptors when paired with green or yellow 
emitters (figs. SI and S2). When used together 
with protein trafficking signals, the fusions pro- 
vide high FRET efficiency and minimal protein 
aggregation in live neurons (Fig. 1, A and B), key 
attributes of a FRET indicator (13, 14). 

We measured responses of AcelQ-mNeon and 
Ace2N-mNeon to voltage depolarization steps 
in cultured human embryonic kidney-293T 
(HEK293T) cells. These sensors responded five 
to six times as fast as MacQ-mCitrine (13) and 
the ASAP1 indicator (7) (Fig. 1C and table S2). At 
termination of a step depolarization, Ace2N-mNeon 
exhibited some hysteresis during its relaxation 
back to baseline fluorescence levels (Fig. 1C, inset, 
and fig. S3). However, the steady-state response 
varied linearly with the transmembrane voltage 
within the physiological range of subthreshold 
potentials, and the initial peak response increased 
linearly between -100 mV and +30 mV (Fig. ID). 
These relationships were similar to those of pre- 
vious FRET-opsin sensors (Fig. ID and fig. S3), 
but the faster kinetics of the Ace indicators en- 
abled superior spike detection fidelity, even for 
rapid successions of spikes that would have 
blurred together with Ca 2+ imaging (Fig. 2, A to 
E). During action potentials in cultured neurons, 
Ace2N-mNeon and AcelQ-mNeon had peak changes 
in fluorescence intensity relative to baseline val- 
ues (A F/F) of 12.0 ± 0.8% (mean ± SEM; N = 12 
cells) and 8.5 ± 0.7% (N = 10 cells), respectively, 
while leaving the spike waveform unperturbed 
(Fig. 2, A and B, and fig. S4). Photobleaching 
rates were those of the constituent mNeonGreen, 
which is as photostable as enhanced green fluo- 



rescent protein (12) and comparable to those 
of other brightly fluorescent voltage indicators 
(table S2). 

To quantitatively compare multiple indicators, 
we used a metric of spike detection fidelity, d\ 
from signal detection theory (9, IS). This metric 
accounts for an indicator’s optical waveform in 
response to a spike and thus captures rates of 
correct and false spike detection better than 
other metrics, such as signal-to-noise ratio (SNR) 
(16). Modest improvements in d' can have huge 
benefits for experimentation because false detec- 
tion rates decline faster than exponentially with 
increases in d! (9, IS). Compared to MacQ-mCitrine, 
the superior brightness and peak A F/F responses 
of the Ace indicators increased d' by factors of 2.2 
to 3.0, with d' values of 100 ± 12 (Ace2N-mNeon, 
mean ± SEM) and 70 ± 8 (AcelQ-mNeon) in cul- 
tured neurons (Fig. 2C). Our calculations pre- 
dicted that this should be sufficient to image 
spikes >0.4 ms in duration, not only in brain 
slices but also in live animals (fig. S5). 

Studies in mouse brain slices and live mice 
validated these predictions (Fig. 2, B to E, and 
Fig. 3, A to F). To facilitate spike detection, we 
sparsely expressed the GEVIs using the SAD-AG 
viral vector, which efficiently transduces neurons 
through axons (17). This labeling approach re- 
duced background fluorescence from out-of- 
focal-plane or nonspecifically labeled sources. 
Previous studies with this virus have shown that 
there is minimal toxicity in neurons up to 10 days 
after infection (17). We imaged thick brain slices 
3 to 5 days after viral injection and found that 
Ace2N-mNeon reported action potentials with 



A F/F = 6.5 ± 1.5% and d f = 32 ± 5 (mean ± SEM; 
Ji — 5 cells) (Fig. 2B). Neurons expressing the 
GEVI had statistically indistinguishable spike 
waveforms compared with nonfluorescent neu- 
rons in the same slices (fig. S4). 

We next imaged Ace2N-expressing neurons in 
the mouse visual cortex, using an indicator var- 
iant (Ace2N-4AA-mNeon) with four extra amino 
acids in the linker domain (table SI). This variant 
yielded improved expression and superior visual- 
ization of individual cells’ dynamics in live mice 
(Figs. 2D and 3A, inset). Because extending the 
linker left the VSD unchanged, the voltage- 
dependent kinetics were unaffected, although 
the increased distance between the FRET pairs 
reduced the peak AF/F response to neural spikes 
(fig. S6 and table S2). In anesthetized mice (Fig. 2, 
D and E) and awake mice (Fig. 3), we readily 
observed spiking waveforms in labeled neurons in 
cortical layer 2/3, with interspike intervals as brief 
as -10 ms— i.e., an interval briefer by a factor of 20 
than the signal integration time of the fast Ca 2+ 
indicator GCaMP6f (1). 

Ace2N-4AA-mNeon also provided a high level 
of spike detection fidelity (d' = 16 ± 1.5; mean ± 
SEM; N = 56 layer 2/3 neurons, -150 pm below 
the brain surface; N = 20 awake mice), at which 
the frequency of spike detection errors is vastly 
less than with previous GEVIs. Imaging at a 1-kHz 
frame rate at a d' value of 16 corresponds to a 
mean rate of <10“ 7 detection errors per day (IS). 
Consistent with this, during simultaneous op- 
tical and loose-patch electrical recordings in 
the same neocortical neurons of anesthetized 
mice (Fig. 2D), the spike trains attained using 
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Fig. 1. Ace FRET-opsin sensors report membrane voltage with ~l-ms 
response times. (A) Linker sequences bridging AcelQ and Ace2N to mNeon- 
Green. Endoplasmic reticulum (ER) export sequence and Golgi export trafficking 
signal (TS) at the construct’s C terminus improve the sensor’s membrane 
localization and hence the signaling dynamic range. (B) Fluorescence signals from 
neurons expressing AcelQ-mNeon or Ace2N-mNeon. (Left) Baseline fluorescence 
emissions from mNeonGreen. (Right) Spatial maps of the fluorescence response 
(A F/F) to a voltage step of approximately 100 mV. Areas of fluorescence and 
voltage response were generally colocalized. Scale bar, 20 pm. Illumination in- 
tensity, 15 mW • mm -2 . (C) Step responses of the Ace sensors, ASAP1 and MacQ- 
mCitrine to +100 mV command voltage steps, normalized to each sensor’s 
maximum (or steady state) A F/F response to the command voltage. The initial 
rise of the Ace2N-mNeon sensor was more than six times as fast as that of ASAP1 
and MacQ-mCitrine (table S2). (Inset) The full step response of all sensors. The trace for Ace2N-mNeon exhibits hysteresis ~40 to 200 ms after the voltage step, 
outside the interval shown in the main plot. Illumination intensity, 15 to 50 mW • mm -2 . Data acquisition rate, 5 kHz. Inset traces were down-sampled to 250 Hz. 
(D) Steady-state responses of FRET-opsin sensors as a function of membrane voltage in cultured neurons (N = 10 cells per trace). Because Ace2N-mNeon 
exhibited hysteric responses to voltage steps (fig. S2), we also plot its peak initial responses. Illumination intensity, 15 mW • mm -2 . Error bars, SEM. 
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Ace2N-4AA-mNeon and those from the electri- 
cal recordings were in perfect concordance (837 
total spikes detected in dual recordings; N = 
3 mice). Highlighting the importance of Ace2N’s 
-threefold increase in d' over the best previous 
GEVIs, in vivo imaging with a value threefold 
lower (d' = 5.3) leads to an unacceptable four 
spike-detection errors per second. The nearly 
vanishing error rate predicted at d' = 16 is un- 
attainable over hours under true experimental 
conditions due to the indicator’s finite photo- 
stability and nonstationary noise sources in the 
live brain other than photon shot noise. Never- 
theless, this calculation underscores the huge 
effect of modest rises in d’, in that Ace2N has an 
error rate that is orders of magnitude lower than 
previous GEVIs and is the first GEVI with accept- 
able error rates in the detection of single action 
potentials in live mammals. 

Ace2N-4AA-mNeon also provided exquisite 
spike-timing accuracy, as determined from the 
dual optical and electrical recordings in live mice 
(Fig. 2E). Relative to the spike times in the 
electrical traces, the spike times in the optical 



traces differed by only 0.24 ± 0.01 ms (SEM; N = 
837 spikes) (16). This degree of spike-timing ac- 
curacy approached the theoretical physical lim- 
its, 0.14 + 0.03 ms, set by quantum mechanical 
photon shot noise (9, 15, 16). 

We also examined in awake mice whether 
individual cells’ ongoing fluctuations in baseline 
Ace2N-4AA-mNeon emissions reflect subthreshold 
membrane voltage dynamics. If so, periods of 
optically reported membrane hyperpolarization 
should have reduced spike rates, and periods of 
reported depolarization should have elevated 
firing rates. This prediction proved correct for all 
56 layer 2/3 neurons imaged in awake mice 
(Fig. 3B). As determined optically, each neuron 
fired spikes from baseline voltage levels that 
were significantly higher than its median rest- 
ing potential (P < 0.05 for all 56 cells; Wilcoxon 
rank-sum test). As a population, the 56 cells 
were far more likely to fire spikes when the 
optically reported membrane voltages were above 
median resting levels (P < 10“ 15 ; Binomial test). 
Ace2N-4AA-mNeon also revealed submillisecond- 
scale features of dendritic voltage dynamics during 



spike generation (Fig. 3C and movie SI). Thus, 
Ace2N reports aspects of individual cells’ sub- 
threshold, dendritic, and high-speed membrane 
potential dynamics in the live brain. 

We next studied visually evoked spiking re- 
sponses. To illustrate targeting of specific sub- 
classes of visual cortical neurons, we expressed 
Ace2N-4AA-mNeon selectively in cells of the pri- 
mary visual area (VI) that sent axons to the latero- 
medial cortical area (LM) (denoted VI— >LM 
neurons), one of 15 cortical areas in mice that re- 
ceive inputs from VI (16, 18). We presented mov- 
ing grating visual stimuli to awake mice and 
imaged the cells’ evoked responses. Generally, 
layer 2/3 VI— >LM neurons’ evoked spiking was 
preferentially elicited by gratings of one orienta- 
tion (4.1 ± 1.1 spikes • s -1 ; mean ± SEM) versus the 
response (or suppression of activity) to gratings 
of the orthogonal orientation (1.0 ± 0.2 spikes • s -1 ) 
(Fig. 3, D to F) (N = 7 neurons from 5 mice; P < 
10“ 3 ; permutation test for each cell; 10 5 permu- 
tations). The mean orientation selectivity index 
(0.82 ± 0.03; mean + SEM), as determined from 
Ace2N-4AA-mNeon fluorescence traces, fit well 



A F/F 

- 10 % 



80 mV 




^ U4*< G*-** ~ I*** 



0.2 s 



200 



A F/F 

- 10 % 

80 mV 



culture 



-20 




CD 
CD 
C 
0 
_c 
o - 

0 
O 

c 
0 
O 
0 
0 

o 

^ -5 



-15 



k. -10 

u: 

< 



0 



Ace2N-mNeon 

AcelQ-mNeon 

MacQ-mCitrine 

Arclight 

ASAP1 



\ 



d‘= 100 

I 



d' : 

+ d'= io 



c/'=5 h 

V v 



or =50 

= 25 

\ \ 

\ \ 

> \ \ 

\ \ 




A F/F 
-3% 



10 mV 



0.2 s 



20 ms 



-1 0 1 
Timing error 
(ms) 



\ 



10 10 4 10 ° 10 ° 

Photons in transient 



Fig. 2. Ace sensors provide about threefold to tenfold better spike detec- 
tion fidelity ( d ') than previous GEVIs. (A) Fluorescence signals from cultured 
neurons expressing Ace2N-mNeon (blue trace) had sharp peaks closely match- 
ing action potentials in concurrent electrophysiological recordings (black trace). 
Red arrows mark a ~5 mV depolarization apparent in both traces. (B) Optical and 
electrical waveforms of single action potentials in example neurons expressing 
Ace2N-mNeon in cell culture (top), Ace2N-mNeon in mouse brain slice (middle), 
and Ace2N-4AA-mNeon in anesthetized mouse brain (bottom). Data (gray 
points) acquired optically from different spikes were temporally aligned to the corresponding peaks in the electrical traces. Mean waveforms of the optical 
data (blue traces; averaged over N = 30 spikes; resampled to 2 kHz) are aligned with the mean electrical waveforms (black traces; whole-cell patch recordings 
for cells in culture and brain slice; loose patch recording for live mouse). (C) Peak A F/F responses to action potentials, as a function of the total number of 
photons detected per spike in cultured neurons. Isocontours (dashed lines) of spike-detection fidelity, d', were determined from measured brightness and 
optical waveforms, as in (B). Error bars, SEM. (D) Concurrent optical (colored) and juxtacellular electrical (black) recordings in an anesthetized mouse from VI 
cells expressing Ace2N-4AA-mNeon. Magenta trace (lower right) shows the two magenta spikes in adjacent trace. Across 837 spikes, the electrical and optical 
traces were in perfect accord. (Inset) Two different neocortical neurons imaged by two-photon microscopy. A maximum projection of a dual-color image stack 
(top) acquired in a live mouse shows an Ace2N-4AA-mNeon-labeled cell and a pipette, filled with red dye, that recorded somatic electrical activity. An image 
acquired in a brain slice (bottom) after in vivo experimentation shows the membrane localization of Ace2N-4AA-mNeon. Scale bars, 40 |im. (E) Histogram of 
timing errors for spikes detected optically as in (D), using the electrical trace to provide the actual spike time (N = 837 spikes from three cells). Red line, Gaussian 
fit. Error bars, SD, estimated as counting errors. Illumination: 15, 25, and 25 mW • mm -2 , respectively, for studies in culture, brain slice, and live mice. Image 
acquisition rates: 440 Hz, 440 Hz, and 1000 Hz, respectively. 
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with values determined by Ca 2+ imaging in the 
same neurons (Fig. 3E) and with previous studies 
(16, 19). There were also light-sensitive VI— >LM 
neurons that responded at the onset or offset of 
visual illumination regardless of the grating orien- 
tation (fig. S8). 

We also studied sensory-driven neural dynam- 
ics in live fruit flies, in which electrical recordings 
are often challenging. Beyond in vivo imaging of 



fast somatic spike trains, GEVTs have the poten- 
tial to reveal fast voltage dynamics in genetically 
identified dendrites and axons. To illustrate, we 
imaged fast dynamics in the olfactory system and 
first examined local neurons (LNs) of the olfactory 
glomeruli (20), in which we used the R55D11- 
GAL4 fly driver line (21) to express an Ace2N 
variant with two extra amino acids in the linker 
domain (Ace2N-2AA-mNeon) (Fig. 4A, fig. S6, 



and table S2). The baseline spike rate in the 
absence of odor stimulation was 7.4 + 2.8 s -1 (N = 
8 neurons from 8 flies), comparable to electro- 
physiological measurements (20). In response to 
odor stimuli, we observed fast, evoked spike 
trains in single trials at firing rates significantly 
above baseline (P < 10 ' 3 ; Wilcoxon signed rank 
test) (Fig. 4B). The peak odor-evoked spike rate 
was 65 ± 7 s -1 ; i.e., the individual spikes fired 
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Fig. 3. Imaging single action potentials and subthreshold membrane 
voltage fluctuations in layer 2/3 visual cortical neurons of awake mice. 

(A) Optical voltage trace acquired in an awake mouse, showing spiking by a 
VI layer 2/3 neuron expressing Ace2N-4AA-mNeon via the SAD-Ace2N-4AA- 
mNeon-RGECOl virus. Dashed boxes indicate intervals shown at successively 
expanded time scales. (Insets) Image acquired in vivo of the cell that pro- 
vided the optical trace and a pair of fluorescence images taken in a fixed 
tissue slice of an identically labeled VI neuron from the same preparation 
expressing Ace2N-4AA-mNeon (green) and RGECOl (red). Scale bar, 40 |im. 

(B) Periods of reduced spiking arose when Ace2N-4AA-mNeon reported a 
hyperpolarization. To track each cell’s membrane voltage apart from its 
spikes, we applied a median filter (50-ms window) to the trace. For non- 
parametric comparisons, we matched each spike to the voltage at which the 
spike occurred, quantified as a percentile of the cell’s full range of mem- 
brane voltages over the full recording. For each cell, the plot shows the spike 
rate (normalized to each cell’s peak rate) at voltages across all percentiles. 

(C) (Top) Spike-triggered average image frames, for the cell in (A), showing 
the mean dendritic activation before and after firing an action potential. 
Times are relative to the spike peak at the soma. The left two dendrites 
activated before the right two dendrites. We calculated mean time traces for 
the soma and the left and right dendrite pairs using the spatial masks 
shown. (Bottom) Mean fluorescence time courses (A F/F) for each of the 
three masks, normalized to the same maximum. The traces confirm the left- 



to-right activation pattern; the right dendrites exhibit a voltage peak 0.25 to 
0.5 ms after the left dendrites. The A F/F image series and time traces were 
sampled in 0.25-ms time bins, using data from 1900 spikes mutually 
aligned to their peaks. Shaded regions on the time traces denote SEM and 
are barely discernible. (D) Example optical traces from a cortical V1->LM 
neuron in an awake mouse, showing visually evoked responses to moving 
gratings (orientations and motion directions marked above each trace). 
Spiking responses to the cell’s preferred grating orientation (second col- 
umn) differed from the activity suppression in response to the orthogonal 
orientation (fourth column). (E) Mean spike rates of the cell in (D), de- 
termined from the optical voltage trace in response to gratings moving in 
different orientations (10 trials per stimulus). Solid black and dashed gray 
circles indicate spike rates of 4 s -1 and 2 s -1 , respectively. Ca 2+ imaging 
using RGECOl in the same cell yielded similar orientation tuning. Evoked 
Ca 2+ signals were integrated over 1 s and normalized so that Ca 2+ and 
spiking responses to the preferred orientation (225°) were plotted at the 
same radius on the polar plot. Shaded areas indicate SEM. (F) Mean ± SEM 
spike rates in response to gratings at the preferred orientation were higher 
than to those oriented orthogonally (P < 0.01 for each of seven cells; 
permutation test; 10 5 permutations). Frame acquisition rate: 1 kHz for 
voltage imaging and 20 Hz for Ca 2+ imaging. Illumination intensity: 20 and 
10 mW • mm -2 , respectively, for voltage and Ca 2+ imaging. Labeled neurons 
were -150 |im below the brain surface. 
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about an order of magnitude faster than Ca 2+ 
imaging can resolve. 

As in mice, we performed dual optical and 
electrical recordings to verify the spike trains, 



this time using whole-cell patch-clamp electro- 
des to access LN neurons’ intracellular potentials 
in intact fly brain explants (Fig. 4C, left). Across 
18,141 recorded spikes, the spike trains provided 



by Ace2N-2AA-mNeon perfectly matched those 
from the patch-clamp recordings (AT = 4 fly brains) 
and had spike-timing errors of 0.19 ± 0.002 ms 
(SEM; N = 18,141 spikes), close to the theoretical 
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Fig. 4. Odor-evoked spiking, dendritic dynamics, 
and voltage propagation delays in fly olfactory neu- 
rons. (A) (Top) Fluorescence images of a local neuron Stimulus 

(left) and a projection neuron (right) expressing Ace2N- 
2AA-mNeon in the fly antenna lobe. (Bottom) Spatial maps of the fluo- 
rescence response (AF/F) at the peak of an odor-evoked transient. Scale 
bar, 40 gm. (B) (Top) Optical voltage traces (blue traces) from the local 
neuron in (A) reveal increased spiking during 5% benzaldehyde odor pre- 
sentation (purple bar). (Bottom) Odor-evoked spike rate as a function of 
time, averaged over five trials (shaded area denotes SEM). (Inset) Mean 
peak spike rate during odor presentation was higher than baseline rates 
without odor [P < 10 -3 ; Wilcoxon signed-rank test; N = 8 neurons and 8 flies 
(20 x U AS- Ace2 N -2 A A- m N eo n/+ ; R55D11-GAL4/+)]. (C) (Left) Concurrent 
optical voltage and whole-cell patch electrical recordings in whole brain 
explants, from the same cell type as in (B). Spikes were evoked by current 
injection. (Middle) Paired optical and electrical traces, taken from periods 
enclosed by dashed boxes in the left panel. Spikes are clearly distinguished in 
the optical traces from both the plateau of subthreshold depolarization and 
each rise to spike threshold. (Right) Histogram of timing errors for the spikes 
detected optically, relative to the spike times in the whole-cell patch recordings 
(N = 18,141 spikes from 4 flies). Red line is a Gaussian fit. Standard deviation 
of the timing errors was 0.19 ± 0.002 ms. Error bars estimating the SD as 
counting errors are too small to be seen. (D) (Left) Spike-triggered average 
image frames for the cell in (C), showing the mean activation in the neural 
processes before and after firing a somatic spike, as determined from spike 
times in the electrical recording. Depolarization started at about -1.0 ms in 
the soma (located in the upper right of each image) and propagated right to 
left across the dendritic tree during a spike. We calculated mean time traces 
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for the soma and two subportions of the dendrites using the spatial masks in 
the lower right panel (cyan line encloses the soma; red and gold lines enclose 
dendritic regions). Scale bar, 40 jim. (Right) AF/F for each of the three spatial 
masks, normalized to the same maximum. The traces confirm the right-to- 
left activation pattern; the left dendrites exhibited a voltage peak 0.5 to 0.7 ms 
after the right dendrites. Mean AF/F image series and traces were sampled 
in 0.25-ms bins, using data from 1300 spikes, temporally aligned to the spike 
peaks. Shaded regions denote SEM and are only discernible for the soma. 
(E) Odor-evoked, optical voltage traces from the DL3 projection neuron den- 
drites of (A), with significant rises in activity over baseline for some [10% 
ethyl acetate (EA) and 3-octanol (1030); P < 0.04 for both EA and 1030), 
but not other odors (10% benzaldehyde (BEN) and isopropanol (IPA); P = 
1 and 0.4 for BEN and IPA, respectively; Wilcoxon signed rank tests; N = 6 flies 
(20 x UAS-Ace2N-2AA-mNeon/+; R26B04-GAL4/+)]. (F) Odor-evoked den- 
dritic activation levels, relative to baseline levels. (G) (Top) Example fly 
expressing Ace2N-2AA-mNeon in DL3 cells in which we concurrently im- 
aged dendrites and axons. (Bottom) Optical traces from both regions show 
concurrent activity. (Inset) Averaged traces aligned to onset of dendritic 
activity reveal a ~4-ms propagation delay between dendrites and axons (N = 
30 transients; shaded regions denote SEM). Scale bar, 50 |im. Illumina- 
tion, 20 mW • mm -2 . Frame acquisition rate, 1 kHz. For display only, each 
optical trace in (B) was high-pass filtered by subtracting a median-filtered 
(50-ms window) version of the trace. Traces in (E) were processed the 
same way for display, but with a 1-s filter window. 
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optimum of 0.11 ± 0.03 ms (Fig. 4C, right). 
Subthreshold dynamics were readily apparent in 
the optical traces, including plateau potentials 
and the rising and falling voltage waveforms 
surrounding action potentials. Ace2N-2AA-mNeon 
also reported the submillisecond-scale dynamics 
of spike back-propagation into the dendritic tree, 
revealing ~0 .5- to 1.0-ms delays between the ini- 
tiating voltage peak at the soma and those in the 
dendrites (Fig. 4D and movie S2). 

We next imaged olfactory projection neurons 
(PNs), which generally have more selective odor 
tuning than LNs (22, 23). We used live flies and 
focused on R26B04-GAL4-labeled PN dendritic 
arbors (Fig. 4A), which exclusively innervate 
the DL3 glomerulus (21). In response to specif- 
ic odors, the arbors exhibited fast, odor-evoked 
voltage dynamics with temporal structure at the 
10- to 100-ms time scale (Fig. 4, E and F). We 
compared these dynamics to those seen by Ca 2+ 
imaging using the GCaMP6f indicator (fig. S9). 
The odor-tuning profiles determined from the 
two modalities were in broad agreement, but 
voltage imaging with Ace2N revealed both 
small-amplitude responses and fast temporal 
signals that GCaMP6f failed to convey (fig. S9). 
Finally, we imaged the PN dendritic arbors and 
axonal boutons simultaneously. Odor-evoked vol- 
tage signals in the two structures were similar, 
but there was a 3.7 ± 0.4 ms (mean ± SEM; N = 
3 flies) propagation delay from dendrites to axons 
(Fig. 4G). Neither Ca 2+ imaging nor electrical re- 
cordings have been capable of revealing this type 
of fast, intracellular signal propagation in a live 
animal. 

By virtue of their bright emissions, fast (<1 ms) 
kinetics, and targeted expression, Ace2N GEVIs 
open a wide range of experimental avenues in 
optical neurophysiology. The capabilities to track 
fast-spiking, subthreshold, and both sensory- 
evoked and back-propagating dendritic voltage 
activity in vivo allow direct visualization of many 
temporal features of neural dynamics, plasticity 
induction, and coding that have long been in- 
accessible. When combined with chronic animal 
preparations (9), it will be possible to track these 
phenomena over multiple days. Ace2N sensors 
resolved sensory-evoked spike trains with firing 
rates as high as -75 s -1 (the fastest we encountered) 
with ~0.2-ms timing accuracy. Whereas in vivo 
Ca 2+ imaging usually resolves successive spikes 
if they are -50 to 250 ms apart and has ~20- to 
100-ms timing accuracy for isolated spikes (1, 2), 
Ace2N sensors directly reveal the constituent 
spikes in spike bursts, a widespread and impor- 
tant means of neural signaling (24, 25). The Ace2N 
sensors’ timing precision is also key to resolving 
long-standing debates over the temporal attributes 
of neural coding (26, 27). 

The retrograde labeling strategy we used to 
express the Ace2N indicator in live mice also 
holds importance for future work, in that every 
soma we imaged had an axon at the virus in- 
jection site. This allowed us to target the subclass 
of VI neurons with projections to area LM. A 
logical next step in the study of VI would be 
systematic voltage imaging of visually evoked 



spiking across all 15 subclasses of VI neurons, as 
categorized by their axonal projections (18). 
There are long-standing in vivo electrophysio- 
logical methods for identifying cells with specific 
axonal projections (28), but they are technically 
onerous to an extent that has precluded wide- 
spread and routine use. Voltage imaging is now 
poised to make commonplace the study of neural 
dynamics in live mammals in chosen anatomical 
projection pathways. 

Until now, subthreshold neural voltage dy- 
namics have usually been inaccessible in live ani- 
mals, especially in fine neural processes in which 
intracellular recordings are nearly prohibitive. 
As shown here, Ace-mNeon reveals fast dendritic 
activity, including stimulus-evoked dynamics 
and spike back-propagation. This is especially 
important for studies of flies and other species 
in which fine neurites are vital to information 
processing (29). There is also rising apprecia- 
tion of the importance of dendritic computation 
in mammalian neurons (30, 31). Ace2N should 
allow direct observations of such phenomena 
in vivo. 

For our in vivo imaging studies, we used wide- 
field epifluorescence microscopy, 1-kHz imaging 
rates, and sparse fluorescence labeling. Fluores- 
cent VI— >LM neural somata were usually -150 pm 
below the brain surface and separated by -40 to 
60 pm (fig. S7). This separation enabled high- 
fidelity tracking of neural spiking despite the 
lack of optical sectioning and the consequent 
background fluorescence. Comparable perform- 
ance should also be feasible in deep brain areas 
by using microendoscopes to access deep tissues 
(9). Studies of both surface and deep neural en- 
sembles would benefit from improved scientific- 
grade cameras for fast imaging (1 kHz) at cellular 
resolution over broader fields of view than are 
possible today. For comparison, Ca 2+ imaging 
studies (10 to 20 Hz) capture specimen areas 
larger by a factor of 100 than those sampled 
here at 1 kHz (9). 

In principle, microscopy modalities with optical 
sectioning should also allow denser fluorescence 
labeling patterns and deeper optical penetration 
through thick tissue. A challenge, however, is that 
nearly all extant in vivo microscopy modalities 
with sectioning involve laser scanning, with pixel 
dwell times typically -0.1 to 2 ps for frame rates of 
-10 to 20 Hz. Given the importance of detecting 
as many signal photons as possible with voltage 
imaging, conventional laser scanning is plainly 
insufficient at the faster frame rates used for in 
vivo voltage imaging (-0.5 to 1 kHz). The intro- 
duction of Ace2N indicators for in vivo voltage 
imaging puts a premium on the innovation of 
microscopes that provide optical sectioning with- 
out conventional laser scanning. The emergence 
of scanless forms of optical sectioning microscopy 
holds particular interest for the further develop- 
ment of in vivo voltage imaging capabilities 
(32, 33). Even with existing optical hardware, 
Ace-mNeon’s capacities for targeted imaging of 
cells of identified types or axonal projections, res- 
olution of fast spike trains, accurate spike-timing 
estimation, and observation of subthreshold and 



dendritic voltage dynamics open many questions 
to empirical study that hitherto could not be 
examined in live animals. 
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Wrapping it up in a person: 
Examining employment and earnings 
outcomes for Ph.D. recipients 
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In evaluating research investments, it is important to establish whether the expertise gained 
by researchers in conducting their projects propagates into the broader economy. For eight 
universities, it was possible to combine data from the UMETRICS project, which provided 
administrative records on graduate students supported by funded research, with data from 
the U.S. Census Bureau. The analysis covers 2010-2012 earnings and placement outcomes of 
people receiving doctorates in 2009-2011. Almost 40% of supported doctorate recipients, 
both federally and nonfederally funded, entered industry and, when they did, they 
disproportionately got jobs at large and high-wage establishments in high-tech and 
professional service industries. Although Ph.D. recipients spread nationally, there was also 
geographic clustering in employment near the universities that trained and employed the 
researchers. We also show large differences across fields in placement outcomes. 



T en years ago, Jack Marburger challenged 
academics to provide scientific evidence 
about the impact of research investments 
( 1 ). The United States Congress has been 
even more insistent: requiring the National 
Science Foundation to “better articulate the 
value of grants to the national interest.” The aim 
of the current study is to investigate the labor 
market outcomes of doctoral recipients. 

Little is known about where research-funded 
Ph.D.’s go when they graduate and enter the 
private sector, and even less is known about the 
characteristics of the businesses that employ 
them. Thus, it has been difficult to capture the 
human dimension of the impact of research 
on the economy. What little evidence there is 
has been based on an analysis of patent clusters 
( 2 , 3 ); the geographic and industry placement of 
new Ph.D.’s ( 4 - 6 ); or on bibliometric approaches 
linking grants, patents, and publications ( 7 , 8 ). 
One noteworthy exception has been expensive— 
the United Kingdom spent more than £34 million 
(U.S. $51 million) in explicit costs, and much 
more in implicit costs, to generate almost 7000 
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case studies. Unfortunately, they lack a common 
framework or shared standards of evidence and 
presentation. Consequently, the extent to which 
this type of information provides rigorous, sys- 
tematic, aggregate insights into economic value 
is far from clear ( 9 ). 

We drew on recent investments to build 
administrative data that cover researchers sup- 
ported by both federally and nonfederally funded 
grants in eight major universities that are members 
of the Committee on Institutional Cooperation 
(CIC) ( 10 ). Participating universities were Indiana, 
Iowa, Michigan, Minnesota, Ohio State, Purdue, 
Penn State, and Wisconsin. Those data, derived 
from the CIC’s UMETRICS project ( 11 ), provided 
the share of time charged to funded research 
projects by all people employed under them (from 
undergraduate and graduate students to post- 
docs to staff and faculty), as well as purchases 
for those projects. Graduate students were linked 
to their dissertations (from ProQuest) and to 



all subsequent employers and earnings in the 
United States through matches to Census Bureau 
data. 

We documented the 2010-2012 earnings and 
placement outcomes of people receiving doctorates 
in 2009-2011. The universities have provided 
identifiers that allow the UMETRICS data to 
be linked to administrative and survey data 
housed at the U.S. Census Bureau under strict 
confidentiality protocols. The data are protected 
by law and are for statistical use only (anonymized 
unique identifiers are used for match keys), and all 
results are reviewed to ensure that no identifiable 
information is disclosed. We performed two 
distinct, but related, analyses of the jobs ob- 
tained by doctoral recipients. The first describes 
the characteristics of the establishments and 
firms where people obtained jobs. The second 
emphasizes the earnings of individuals at those 
jobs. It should be noted at the start that these 
analyses are designed to be descriptive and do 
not demonstrate causation. 

Placement data were obtained from links to 
two files derived from administrative data and 
augmented by survey data: (i) the Business Register 
(BR), which is the universe of U.S. nonagricultural 
firms and associated establishments, and (ii) the 
Longitudinal Business Database (LBD), which 
contains longitudinally linked data for all firms 
and associated establishments with paid emp- 
loyees in the United States ( 12 , 13 ). These files 
were used to describe the characteristics of 
the establishments and firms that employ the 
UMETRICS doctoral recipients covered by the 
data (an establishment is the physical place 
where business is conducted and the unit of 
observation at which industry and geographic 
location are defined; firms can own one or more 
establishments). The matched data were used to 
describe the sector and detailed industry classi- 
fication of each establishment at which the 
recipients worked, the geographic location of 
their place of work, as well as characteristics 
related to productivity, such as size and payroll 
per worker ( 14 ). Age was determined at the firm 
level, and establishments owned by R&D per- 
forming firms were identified from the Business 
Research & Development and Innovation Survey 



Table 1. Postgraduation employment of UMETRICS doctoral recipients who were paid by research 
grants and left the university between 2009 and 2011. The national workforce distribution is calculated 
from all employment in all establishments covered by the Census’s LBD between 2010 and 2012. 

Doctoral recipients placed in sector (%) 



Industry 



Locale and small 


R&D firms 


Non-R&D firms 


Academia 


Government 


All 


Placed within sector 


17.0 


21.7 


57.1 


4.1 


100.0 


National sample (M) 


10.8 


75.0 


10.7 


3.5 


100.0 


Of those in sector, 
percent placed: 


Within 50 miles 


10.1 


23.5 


8.9 


18.2 


12.7 


Within state 


16.6 


36.0 


18.0 


25.8 


22.0 
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(15). The variables we analyzed, the level at 
which they were measured, and their source 
are listed in table SI. 

Individual earnings data were derived from 
links to administrative records. Details of the 
data construction and links are in the supple- 
mentary materials (SM), including fig. SI and 
table S2, A and B. De-identified versions of the 
UMETRICS data will be made available to the 
scholarly research community through a con- 
fidentially protected virtual digital enclave being 
developed at the Institute for Research on In- 
novation and Science (IRIS) (iris.isr.umich.edu) 
(10). Data integrated with census data on people 
and on their employers and their employer’s 
characteristics will be made available at the 



Federal Statistical Research Data Centers through 
a partnership between IRIS and the U.S. Census 
Bureau (www.census.gov/ferdc) under strict confi- 
dentiality protections. 

There were 3197 graduate students on research 
payrolls at the sample universities in the period 
2009-2011 who received a doctoral degree during 
that period and who were employed at a different 
institution in subsequent years. The major sectors 
into which they flowed in the year after their 
separation from university employment are sum- 
marized in Table 1 and compared with the sectors 
of employment for the U.S. workforce as a whole, 
which provides a benchmark (but not a formal 
control) group. The majority of graduate students 
(57.1%) went to academia— presumably many to 



postdoctoral positions. A large percentage (38.7%) 
found jobs in industry, notably, about 17% in 
establishments owned by firms that perform 
research and development (R&D) (for comparison, 
10.8% of the U.S. workforce is employed in such 
establishments) and about 21.7% in establishments 
owned by firms that do not perform R&D (versus 
75.0% of the U.S. workforce). Only a small per- 
centage (4.1%) entered government. 

Evidence that research funding to these uni- 
versities provides training to a workforce that 
participates in a national (and likely interna- 
tional) labor market is shown in Table 1. For 
each of the universities in our sample, more 
than one in five doctoral recipients stayed in 
the state in which the university was located, 
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Fig. 1. UMETRICS doctoral recipients are placed at establishments 
that are larger and have higher payrolls per worker. Medians are dashed 
inner lines, and means are solid outer lines. The standard deviations in employ- 
ment at establishments that employed UMETRICS doctoral recipients, at all U.S. 
establishments owned by R&D performing firms, and all U.S. establishments are 
6407, 3661, and 2362, respectively; the standard deviations in annual payroll per 
worker are $120,199; $56,252; and $44,327, respectively; the differences in 
employment size and payroll per worker are statistically significant. Annual payroll 



per worker is the average payroll (the total payroll divided by the number of 
employees) across all employees at the three types of establishments— all U.S. 
establishments, all U.S. establishments owned by firms that perform R&D, and 
the establishments that employed UMETRICS doctoral recipients (regardless 
of whether they are owned by firms that perform R&D). National and R&D 
establishments are weighted by total establishment employment, whereas 
doctoral recipient establishments are weighted by the number of doctoral 
recipients employed. Values for annual payroll per worker are U.S.$1 xlOOO. 
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Fig. 2. Annual payroll per worker at establishments that employed UMETRICS doctoral recipients, establishments owned by firms that per- 
form R&D, and all U.S. establishments. Values for average annual payroll per employee are U.S.$1 xlO,000. 
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and about 13% stayed within 50 miles of the 
university. More doctoral recipients stayed in 
the university’s state than moved to any other 
single state, with the exception of California, 
which received more students from two of our 
eight universities. The results are substantially 
unchanged when the sample was restricted to 
those supported only by federal research grants 
(table S3). 

The data permit an even deeper examination 
of the geographic destinations of the doctoral 
recipients. For example, 19% of the doctoral re- 
cipients who left their university’s state headed 
for California, which has only 12% of the U.S. 
population. We expect that this is partially due to 
the fact that more R&D is conducted in California 
than in any other state. The states to which people 
moved are shown in fig. S2, and a comparison of 
the locations of Ph.D. recipients to R&D and 
population by state are shown in table S4. The 
other major destination states in terms of numbers 
of graduates among people who leave the state 
where they trained were Illinois, New York, Texas, 
North Carolina, Massachusetts, Pennsylvania, and 
Washington (state), which either have a large 
share of R&D relative to their populations or 
are very populous. 

It is also possible to examine the specific in- 
dustries that were most likely to employ the 
sample of doctoral recipients paid by research 
grants (table S5); results for doctoral recipients 
supported by federally funded projects are reported 
in table S6. The employers were much more likely 
to be in industries such as engineering, or high- 
tech and professional service fields (including 
medicine) than U.S. employers at large: For 
example, the shares of doctoral recipients employed 
in pharmaceutical and medicine manufactur- 
ing, semiconductors, and computer systems design 
were between 4 and 19 times the U.S. average. 
Uncommon employment destinations for doctoral 
recipients were restaurants and eating places and 
grocery stores. 



The establishment-level data also permitted 
an examination of the differences between the 
employers of the UMETRICS doctoral recipients 
being studied, the typical employer across the 
U.S. as a whole, and the typical U.S. establish- 
ment owned by a firm performing R&D (Fig. 
1). Because the distribution of establishments 
is highly skewed, it is informative to provide 
information on both the means and the medians. 
The median establishment that employed the 
doctoral recipients in our sample has more 
employees (at 1084) than the national median 
(at 73) or even than establishments owned by 
firms performing R&D (341) (Fig. 1). We used 
payroll per worker (constructed by dividing 
total payroll by the number of employees) to mea- 
sure the average earnings at establishments. The 
median establishment that employed UMETRICS 
doctoral recipients also tended to have a higher 
payroll per worker (over $90,000) than the median 
U.S. establishment, which has a payroll per worker 
of just over $33,000, or the median establishment 
owned by a firm performing R&D, which has a 
payroll per worker of just less than $61,000. The 
same results hold when we used the mean as the 
measure of central tendency. 

The data were rich enough to go beyond sum- 
mary statistics and to characterize the distri- 
bution of each measure for all establishments 
employing the workers under study. In particular, 
we calculated the payroll per worker for each 
establishment (regardless of R&D status) employing 
individuals in our sample, all U.S. establishments, 
and all U.S. establishments owned by firms 
performing R&D (regardless of whether they 
employ the doctoral recipients in our sample) 
and report the distribution of each in Fig. 2. 
Although only 8.3% of the U.S. workforce and 
24% of workers at establishments owned by 
firms performing R&D worked at establishments 
with payrolls per worker in excess of $100,000, 
more than half (51%) of the sample of doctoral 
recipients do. The results are consistent with the 
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hypothesis that doctoral recipients are placed 
in establishments with characteristics usually 
associated with greater productivity. 

It is possible to examine individual earnings 
outcomes, as well as the placement outcomes 
described above. The data also permit the sta- 
tistical analysis of the relation between a re- 
searcher’s field of study (based on ProQuest 
dissertation data) and subsequent placement 
and individual earnings. 

The results of the analysis of individual earnings 
outcomes in the year after Ph.D. receipt are 
reported in (Fig. 3). Descriptive statistics and 
regression results are shown in tables S7 and 
S8. Although earnings are an imperfect measure 
of the value of skills, especially for people still 
investing in their human capital, the labor eco- 
nomics literature finds a strong correlation be- 
tween skills and earnings ( 16 ). The two fields 
with the highest earnings are mathematics and/or 
computer sciences and engineering, with mean 
earnings in excess of $65,000. Although mean 
earnings for doctoral recipients are low in biology 
at $36,000, this may be due to many taking jobs 
as postdoctoral researchers in life sciences. If the 
sample is subset to only include industry earn- 
ings, the average earnings increase by one quarter 
(although the gap varies by field), with the high- 
est earnings in mathematics and/or computer 
sciences (almost $90,000) and engineering (almost 
$80,000). 

Placements are reported in the four right 
panels of Fig. 3. Doctoral recipients with degrees 
in engineering are most likely to go to work in 
industry, followed by those with degrees in math 
and/or computer science. New degree holders 
in engineering are by far the most likely to go 
to establishments that are owned by firms 
performing R&D. In all fields, doctoral recipients 
going into industry are likely to go into estab- 
lishments with high payroll per worker [above 
the median for the establishments within their 
six-digit industry code (see SM)]. The recent 
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High Wage Firm Young Firm 
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Fig. 3. The annual earnings and placement of doctoral recipients supported by grants vary by field. Young firms are defined to be those <5 years 
old. High-payroll per worker establishments are defined as those with a payroll per worker above the median for the establishments within their six-digit 
industry. Means and standard errors (error bars) for each variable. 
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economics literature has emphasized the impor- 
tance of young firms in contributing to economic 
growth (17, 18). We examine placements and find 
that engineers, physicists, and computer scien- 
tists are most likely to go to establishments of 
young firms. 

Just as it was possible to show the distribution 
of average payroll per worker for employers, it is 
also possible to describe the distribution of 
individual earnings. One year after leaving the 
university, doctoral recipients placed in industry 
had considerably higher earnings than those who 
went to government or academia, and consid- 
erably more doctoral recipients placed in aca- 
demia had earnings below $50,000 per year (Fig. 4). 
There is suggestive evidence that at least part 
of that may be due to doctoral recipients taking 
postdoctoral research positions; there is a notice- 
able heaping of the earnings distribution at 
just under $50,000 for those in biology, chem- 
istry, and health disciplines. However, the earn- 
ings growth (albeit from a lower base) was still 
robust in all sectors 2 years after doctoral recip- 
ients left the university, as shown by the dis- 
tribution of earnings growth. 

This work takes a first step toward describing 
the links between research funding and the 
economy by tracing the flows of doctoral re- 
cipients employed by research grants subse- 
quent to their separation from the university 
that employed them. The analysis shows that 



many doctoral recipients who were employed 
by funded research projects moved into the 
nonacademic sector and that, when they do, 
they disproportionately get jobs at establishments 
with high payroll per worker and in high-tech 
and professional service industries. Although 
the results are descriptive and not causal, the 
findings are consistent with sociological research 
regarding knowledge flows. A major way in 
which knowledge is transmitted from research 
institutions to the economic marketplace is 
through the placement of people at businesses 
that draw on that knowledge (10). As research 
has shown, and as Oppenheimer pointed out, 
the best way to send knowledge is to wrap it 
up in a person (19). Higher earnings and place- 
ment in large establishments with high payroll 
per worker, and in establishments owned by 
firms performing R&D, all characteristics cor- 
related with higher productivity, are consistent 
with that view (20). 

As with any initial examination of new data, 
there are a number of issues and future ex- 
tensions to note. There is a need to develop a 
statistical framework to make causal inferences, 
which involves identifying appropriate counter- 
factuals or quasi-experimental variation. The 
Census Bureau links will make it possible to 
study both longer-term career trajectories and 
the characteristics of businesses started by re- 
searchers and to compare those with the careers 



of different types of workers and other business 
startups in the United States. In addition, the 
type of analysis we do here for Ph.D. recipients 
can be extended to study the career outcomes of 
other groups involved in research— particularly 
postdoctoral researchers, graduate students who 
do not get doctoral degrees, undergraduate 
students, research staff, and people employed 
by nonfederal projects. As the database ex- 
pands, researchers can begin to study how out- 
comes relate to whether funding is federal or 
nonfederal. 

There are important caveats, however. Al- 
though extensible to all research universities, 
the institutions we studied should not be 
viewed as representative of all academic re- 
search institutions. They are large public in- 
stitutions in the Midwest, and many have large 
engineering programs and medical schools. 
Also, the use of U.S.-based administrative and 
survey data limits the ability to track students 
who leave the United States. The analysis is 
explicitly descriptive in nature and is not in- 
tended to make any causal assertions. However, 
because the data will be available at IRIS and 
the Federal Statistical Research Data Centers, 
the research community can build on this infra- 
structure to advance the science of science 
and to provide policy-makers with research- 
backed tools to assess the effects of investments 
in science. 




Earnings (US$) 



Fig. 4. Annual earnings (U.S.$1 x 1000) and earnings growth of 
UMETRICS doctoral recipients by sector and discipline. The 

figure plots the smoothed share of UMETRICS doctoral recipients 
(the probability density estimated using a Gaussian kernel model) at 
each level of earnings or earnings growth (with bandwidths of 
$10,000; $10,000; and 25%, respectively). Individual earnings data 
are derived from a match to W-2 earnings data. 
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SOCIAL BEHAVIOR 

Sexual fidelity trade-offs promote 
regulatory variation in the prairie 
vole brain 

Mariam Okhovat , 1 Alejandro Berrio , 1 Gerard Wallace , 1 
Alexander G. Ophir , 2 Steven M. Phelps 1 * 

Individual variation in social behavior seems ubiquitous, but we know little about how it relates 
to brain diversity. Among monogamous prairie voles, levels of vasopressin receptor (encoded 
by the gene avprla) in brain regions related to spatial memory predict male space use and 
sexual fidelity in the field. We find that trade-offs between the benefits of male fidelity and 
infidelity are reflected in patterns of territorial intrusion, offspring paternity, avprla 
expression, and the evolutionary fitness of alternative avprla alleles. DNA variation at the 
avprla locus includes polymorphisms that reliably predict the epigenetic status and neural 
expression of avprla, and patterns of DNA diversity demonstrate that avprla regulatory 
variation has been favored by selection. In prairie voles, trade-offs in the fitness consequences 
of social behaviors seem to promote neuronal and molecular diversity. 



S ocial behavior emerges from the complex, 
dynamic, and often strategic interactions 
of individuals— a complexity that places it 
among the most challenging and interesting 
behaviors to study. Neuroscience has eluci- 
dated many mechanisms of social behavior ( 1 , 2). 
In parallel, evolutionary biology has outlined how 
social interaction can promote variation within a 
species {3-5). Frequency- or density-dependent 
selection, for example, maintains individual dif- 
ferences in the parental care of sunfish {3), the 
territorial defense of lizards ( 4 ), and the can- 
nibalistic behavior of tadpoles (5). Among hu- 
mans, similar forces have been proposed to 
explain differences in personality, resilience, 
and psychiatric risk {6-8). Given that social 
diversity is central to behavioral ecology, social 
psychology, and mental health, it is surprising 
that we know so little about natural variation in 
the social brain, how it emerges from the in- 
teraction of genetic and epigenetic processes, or 
how it has been sculpted by evolutionary forces. 

We explored individual differences in neuronal 
gene expression in the monogamous prairie vole, 
Microtus ochrogaster, a small North American 
rodent whose males and females form pair 
bonds and share parental care {9). Prairie vole 
pair-bonding is governed by multiple modulators 
and brain regions {2, 10, 11). Of these genes, 
the vasopressin la receptor (VlaR, encoded by 
avprla) is particularly well studied {2, 11-15). 
VlaR expression can vary profoundly across 
individual prairie voles {12), and its abundance 
in a spatial-memory circuit predicts sexual fi- 
delity in males {13, 14) but not females (sup- 
plementary materials), a finding consistent with 
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male-specific vasopressin effects in other con- 
texts (15). We used the relationship between 
avprla expression and male fidelity to examine 
how social forces contribute to brain diversity. 
Specifically, we asked whether the fitness con- 
sequences of male sexual fidelity promote genetic 
and epigenetic variation in avprla. 

Although prairie voles are socially monogamous, 
they are not sexually exclusive {16). Approximately 
25% of young are conceived outside a pair bond 
(termed extra-pair fertilizations, or EPFs). Male 
fidelity is often thought to depend on spatial 
strategies that balance the demands of mate- 
guarding against the value of mating multiply 
(17, 18). To examine the relationship between 
space use and sexual fidelity among male prairie 
voles, we estimated the intensity of a male’s space 
use by fitting kernel density estimates to animal 
positions measured over several weeks by radio- 
telemetry (Fig. 1, A and B, and fig. SI). By over- 
laying these maps of space-use intensity, we 
could estimate how often males encounter other 
individuals either at home or in neighboring 
territories. We found that the spatial behavior 
of EPF males differs from that of males who sire 
young only with a partner (intra-pair fertiliza- 
tions, IPF). EPF males have larger home ranges 
{P < 0.05; Fig. 1C), and they more frequently en- 
counter extra-pair females {P < 0.0001; Fig. ID), 
intrude on territories {P < 0.01; Fig. IE), and 
are intruded upon {P < 0.01; Fig. IF). The rate at 
which a male intrudes on a neighbor’s territory is 
correlated with the rate at which he encounters 
extra-pair females [Pearson’s correlation coefficient 
(r) = 0.69, P < 0.0001], but also with the rate at 
which he is intruded upon by other males (r = 
0.83, P < 0.0001; Fig. 1G). Overall, the data 
suggest that venturing away from a male’s core 
home range increases encounters with both extra- 
pair females and their aggressive mates; these 
intrusions may offer the opportunity for extra- 
pair paternity, but they also increase the rates at 
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which a male’s home range is visited by neigh- 
boring males. This pattern is consistent with data 
suggesting that pair-bonded EPF males are more 
likely to be cuckolded ( 14 ). Increasing the extra- 
pair female encounter rate seems to come at 
the expense of intra-pair mate-guarding. 

Among prairie voles, we find that neuropeptide 
receptors show profound variation in nodes of a 

Fig. 1. Male sexual fidelity predicted by patterns 
of space use, social interaction, and VlaR. 

(A and B) Intensity of male space use. The x and y 
axes are enclosure dimensions (20 m x 30 m); 
the height and color of the peaks indicate prob- 
ability densities. A focal male is indicated as a 
solid peak; nonfocal males are indicated as blue- 
contoured peaks. Single males are not shown. 
Arrows indicate the regions of likely intrusion by 
the focal male. (C to F) EPF and IPF males differ 
in space use. (G) Rates of intrusion and of male 
visitation are correlated. (H) Regions of a spatial- 
memory circuit (31) vary in receptors for vasopressin 
(red) or oxytocin (blue) (13, 19). Abbreviations 
are as follows: ERC, entorhinal cortex; Hipp, hip- 
pocampus; AThal, anterior thalamus. (I to K) 
Autoradiograms for VlaR in the RSC. RSC-VlaR 
abundance (in dissociations per minute per milli- 
gram of tissue) predicts sexual fidelity and (L) in- 
trusion rate. All bars show mean ± SE. *P < 0.05, 
**P < 0.01, ***P < 0.001. 



Fig. 2. SNPs in regulatory regions of the avprla 
locus predict RSC-VlaR. (A) DNAse I hypersen- 
sitivity in Mus brain and mammalian conservation 
(22). (B) Structure of prairie vole avprla locus 
(exons, blue; microsatellites, white). (C) Associa- 
tion of avprla SNPs with RSC-VlaR abundance. 
Each bar is a SNP; the x axis depicts position 
along the avprla locus; they axis depicts strength 
of association [-logi 0 (P)].The lower horizontal gray 
line shows uncorrected a = 0.05; the upper hori- 
zontal gray line shows corrected a = 0.00054. 
(D) Fold enrichment by H3K4mel ChIP-seq com- 
pared to input chromatin. Horizontal bars mark 
peaks corresponding to putative enhancers. 
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spatial memory circuit including the hippocampus, 
laterodorsal thalamus (LDThal), and retrosplenial 
cortex (RSC; Fig. 1H). Remarkably, variation in 
each of these regions predicts aspects of space 
use and paternity in the field ( 13 , 19 ). The re- 
lationship between spatial memory and sexual 
fidelity is not clear, but males with low VlaR in 
RSC or LDThal have been hypothesized to have 



a poor memory for locations of aggressive inter- 
actions, a cognitive strategy that could promote 
territorial intrusion and extra-pair encounters ( 14 ). 
In contrast, a male with abundant VlaR may 
better monopolize a mate but might encounter 
fewer extra-pair females. To look for evidence of 
fitness trade-offs that could promote forebrain 
diversity, we examined the relationship between 
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RSC-VlaR and our measures of space use. As 
reported previously, faithful IPF males have more 
RSC-VlaR than EPF males [P < 0.001, Fig. 1, I 
to K; (13)1. Low levels of RSC-VlaR were also 
associated with high intrusion rates (RSC, P < 
0.01; pairing status, P < 0.0001; RSC x status, P < 
0.05; Fig. 1L) and poor mate-guarding (male 
visits received: RSC, P < 0.05; pairing status, P < 
0.0001; RSC x status, P > 0.10). VlaR levels in 
another node in this circuit (the LDThal) also 
predicted sexual fidelity and space use, whereas 
brain regions associated with pair-bonding and 



aggression (the ventral pallidum and lateral sep- 
tum) did not [supplementary materials (13)1. 
These data suggest that trade-offs between the 
fitness benefits of intra-pair and extra-pair pater- 
nity could contribute to diversity in this memory 
circuit. 

In order for selection to have promoted neu- 
ronal diversity, such variation must be heritable. 
We asked whether single-nucleotide polymor- 
phisms (SNPs) in avprla predicted individual 
differences in VlaR abundance. We sequenced 
~8 kb of the avprla locus (Fig. 2 and fig. S2) 
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Fig. 3. Genotype differences in regulation of avprla. (A) Homozygotes differ in abundance of 
VlaR (dissociations per minute per milligram) and (B) avprla mRNA in the RSC. (C) RSC avprla 
transcript abundance correlates with VlaR protein. (D) Fixed (gray) and polymorphic (blue) CpG 
sites along avprla. The red bars are SNPs associated with RSC-VlaR. The shaded gray box indicates 
a putative intron enhancer. A cluster of CpG sites were selected for pyrosequencing, including 
polymorphic CpG SNP 2170 in red. (E) HI/HI males have fewer CpG sites in the intron and (F) lower 
levels of enhancer methylation. (G) RSC enhancer methylation correlates with VlaR abundance (ft 2 = 
0.70, ft < 0.0001). Bars are means ± SE. ***P < 0.001. 



from lab-reared males with substantial field 
data [Fig. 1 (13)1 and from wild-caught adults. 
Of 151 SNPs, 4 tightly linked polymorphisms 
predicted RSC-VlaR (Fig. 2, multiple-test corrected 
a = 5.4 x 10^). These SNPs were upstream of 
the coding sequence (SNP -1392, P = 6.3 x 10“ 6 ), 
in the intron (SNPs 2170 and 2676, P = 4.7 x 10” 6 ), 
and in the second exon (SNP 3506, P = 5.0 x 
10“ 5 ). We refer to the genotypes defined by these 
linked SNPs as HI (high)- and LO (low)-RSC al- 
leles. The effects of HI and LO alleles were 
stronger among lab-reared animals (P < 0.0001) 
than wild-caught animals (P < 0.05; genotype x 
rearing P = 0.002; fig. S3), which suggests that 
population structure or developmental environ- 
ment may influence cortical VlaR We also found 
that a distinct SNP predicted VlaR in the LDThal 
(SNP 5168, P = 3.6 x lO -4 ), but none of the 151 
SNPs predicted VlaR in the ventral pallidum or 
lateral septum (fig. S4). Thus, VlaR levels in 
regions implicated in spatial memory and sexual 
fidelity were linked to avprla sequence variation, 
whereas regions important in pair-bonding and 
aggression were not. 

We examined the stability and specificity of 
the HI- and LO-RSC associations with a breeding 
design that controlled for potential confounds of 
our initial study. We obtained a new genetic 
stock from a third site >100 miles from prior 
sites. Heterozygous HI/LO parents were crossed 
to produce siblings that differed in their geno- 
types but shared a common genetic background, 
rearing environment, and lack of sexual experi- 
ence. We again found that HI and LO alleles 
influenced VlaR in the RSC (P < 0.0001; Fig. 3A), 
but not in other brain regions (fig. S5). Thus, our 
data demonstrate a replicable, robust, and spe- 
cific association between the HI-RSC allele and 
high RSC-VlaR expression. However, differences 
between wild-caught and lab-reared animals (fig. 
S3), as well as previously reported developmental 
manipulations (20), suggest that epigenetic var- 
iation may also be at play. 

If individual differences in RSC-VlaR abundance 
are due to differences in the regulation of avprla, 
then HI/HI and LO/LO genotypes should differ in 
avprla transcript abundance. We dissected the 
RSC of the lab crosses reported above and used 
quantitative polymerase chain reaction to quan- 
tify avprla mRNA Genotypes differed significantly 
in avprla transcript abundance (ACt versus 
$-actin, P < 0.001, Fig. 3B). Moreover, individual 




Fig. 4. Selection maintains regulatory variation at avprla. (A) Context-dependent selection on HI-RSC and LO-RSC alleles in the field. (B) avprla has more 
intermediate frequency alleles than do neutral markers. (C) Tajima’s D value is significantly positive for avprla but not for neutral loci. (D) Regulatory regions of 
avprla had higher ratios of within: between species differences than nonregulatory regions. *ft < 0.05, **ft < 0.01, not significant = ft > 0.10. 
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differences in avprla mRNA were strongly as- 
sociated with RSC-VlaR protein [linear regression 
coefficient ( R 2 ) = 0.7 5, P < 0.0001, Fig. 3C]. 

To determine whether any RSC-associated 
SNPs were within DNA sequences that might 
contribute to avprla regulation, we performed 
chromatin immunoprecipitation sequencing (ChlP- 
seq) targeting the histone modification H3K4mel, 
a marker for regulatory sequences known as 
enhancers (21). We dissected RSC samples from 
eight new lab-reared animals. Within a 25-kb 
sequence centered on the avprla translation 
start site (Fig. 2D), the H3K4mel mark was 
specifically associated with two regions within 
the avprla locus [P < lx 10“ 7 , false discovery 
rate (Q) < 0.0001; supplementary materials]. 
One putative enhancer was in the center of the 
intron, including both intron SNPs of the HI/ 
LO alleles; the second overlapped the second 
exon and included the fourth linked SNP (Fig. 
2C). Three of the polymorphisms that define the 
HI and LO alleles are within putative enhancer 
regions, and the fourth is within a conserved 
deoxyribonuclease I (DNAse I) hypersensitive 
site [Fig. 2A (22)]. Thus, all four RSC-associated 
SNPs coincide with markers of transcriptional 
regulation. 

We next asked whether differences in RSC 
avprla transcript and VlaR protein abundance 
reflected differences in the epigenetic state of 
the avprla locus. We focused on the putative 
intron enhancer: This sequence had strong 
evidence of H3K4mel enrichment and included 
the two SNPs most strongly linked to RSC-VlaR. 
SNP 2170 proved to be a G/T polymorphism 
that altered the presence of a CpG site, a common 
target of DNA methylation (23). Moreover, this 
CpG/CpT polymorphism is linked to a cluster of 
CpG polymorphisms within the enhancer (Fig. 
3D). HI-RSC alleles have fewer CpG sites than 
LO alleles (P < 0.0001, Fig. 3E), suggesting fewer 
opportunities for methylation. We isolated DNA 
from the RSC, treated it with bisulfite, and 
performed pyrosequencing of this enhancer. HI/ 
HI animals had less enhancer methylation than 
LO/LO animals (P < 0.0001, Fig. 3F). Genotypes 
also differed in enhancer methylation if we 
focused solely on nonvariable CpG sites [mean 
(pi) ± SE, HI/HI 67.6 ± 1.6%, LO/LO 75.6 ± 1.3%; 
P = 0.001]. Moreover, avprla enhancer meth- 
ylation is significantly associated with RSC- 
VlaR abundance (P < 0.0001, Fig. 3G). Methylation 
at noncoding CpG sites is known to recruit 
methyl-binding proteins, histone de-acetylases, 
and other silencing proteins (24); our data 
suggest that SNP 2170 and neighboring CpG 
polymorphisms may alter the function of an 
intron enhancer by changing the number of CpG 
sites available for methylation. 

Our molecular data indicate that specific 
alleles are robust predictors of RSC-VlaR, and 
they suggest mechanisms by which specific 
SNPs might exert influence on avprla expression. 
If genetic differences in RSC-VlaR are adaptive— 
a “balanced polymorphism” of the brain— we 
might expect differences in how HI- and LO- 
RSC alleles gain fitness. Using data from lab- 



reared animals monitored in the field (Fig. 1), 
we calculated the number of embryos that each 
male sired either with a partner (IPFs) or non- 
partner (EPFs) and estimated the relative fitness 
of HI and LO alleles in each context. Although 
the alleles had similar fitness overall, selection 
favored HI alleles in the context of IPFs and LO 
alleles in the context of EPFs (Fig. 4A, P < 0.05). 
Thus, fluctuations in the defensibility of females 
could profoundly influence the strength and di- 
rection of selection on HI and LO alleles. Prairie 
voles exhibit wide fluctuations in population den- 
sity, ranging from ~25 to 600 voles per hectare 
in a year (25); high densities increase the rate of 
extra-pair interactions (26) and reduce the defen- 
sibility of prairie vole females (27). Manipulative 
studies will be needed to test whether fluctua- 
tions in population density or allele frequency 
promote variation in avprla and related behaviors. 

If genetic variation at avprla produces variation 
in memory regions, and this in turn influences 
space use and sexual fidelity, then over time we 
expect selection to have influenced patterns of 
avprla nucleotide variation. We tested for a 
history of balancing selection by comparing the 
frequencies of SNPs at avprla to three putatively 
neutral nuclear loci among our original wild- 
caught samples. We found that the avprla lo- 
cus was strongly skewed toward an excess of 
intermediate-frequency alleles, a classic signature 
of balancing selection (Fig. 4B, likelihood ratio = 
120.3, df = 4, P = 4.7 x 10” 25 ). Similarly, avprla 
had a positive Tajima’s D value (P < 0.05) (28), 
whereas our neutral loci had negative values (P > 
0.10, Fig. 4C). Lastly, a Hudson Kreitman Aguade 
test (29) comparing the number of within- and 
between-species differences indicated an excess 
of standing variation within regulatory regions 
(defined by H3K4mel ChIP-seq and DNAse hyper- 
sensitivity P < 0.01, Fig. 4D). We conclude that 
balancing selection has actively maintained 
regulatory variation at the avprla locus. This 
regulatory variation seems to be specifically as- 
sociated with brain regions related to memory 
and space use. 

These data provide a remarkably coherent 
perspective on the origin and maintenance of 
diversity in the social brain. VlaR levels in memory 
structures predict whether males will intrude on 
neighbors and gain extra-pair paternity, or ex- 
clude intruders and improve intra-pair pater- 
nity. Nucleotide polymorphisms within regulatory 
sequences robustly and specifically predict VlaR 
variation in these same brain regions. Within the 
RSC, we find that low-expressing alleles differ in 
CpG abundance and methylation status. Because 
CpG sites can be gained or lost easily [-25% of 
single nucleotide differences between humans 
and chimps, for example, consist of the gain or 
loss of a CpG site (30)], we hypothesize that CpG 
polymorphisms may often shape heritable varia- 
tion in environmental sensitivity. Genetic markers 
for this neuronal phenotype exhibit strong evi- 
dence of balancing selection. Together these data 
suggest that trade-offe in the fitness consequences 
of spatial behaviors promote diversity in the 
social brain. By focusing on what would seem 



to be the simplest of social phenotypes— the 
neural expression patterns of a single gene— 
we gain insights into the complex interplay of 
forces that shape both gene function and social 
evolution. 
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